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ID-65 



Clone 3-60 

5 

(SEQ ID NO: 1) 

GTGTTTATGATGAAAAAAGGACAAGTAAATGATACTAAGCAA 

TCTTACTCTCTACGTAAATATAAATTTGGTTTAGCATCAGTAA 

TTTTAGGGTCATTCATAATGGTCACAAGTCCTGTTTTTGCGGA 

1 0 TC A A ACTAC ATCGGTTC AAGTTAATA ATC AG AC AGGC ACTAG 
TGTGGATGCTAATAATTCTTCCAATGAGACAAGTGCGTCAAGT 
GTGATTACTTCCAATAATGATAGTGTTCAAGCGTCTGATAAAG 
TTGTAAATAGTCAAAATACGGCAACAAAGGACATTACTACTC 
CTTTAGTAGAGACAAAGCCAATGGTGGAAAAAACATTACCTG 

1 5 AAC AAGGGAATTATGTTTAT AGC AA AG A AACCGAGGTGAA AA 
ATACACCTTCAAAATCAGCCCCAGTAGCTTTCTATGCAAAGA 
AAGGTGATAAAGTTTTCTATGACCAAGTATTTAATAAAGATA 
ATGTGAAATGGATTTCATATAAGTCTTTTGGTGGCGTACGTCG 
ATACGCAGCTATTGAGTCACTAGATCCATCAGGAGGTTCAGA 

20 GACTAAAGCACCTACTCCTGTAACAAATTCAGGAAGCAATAA 
TCAAGAGAAAATAGCAACGCAAGGAAATTATACATTTTCACA 
TAAAGTAGAAGTAAAAAATGAAGCTAAGGTAGCGAGTCCAA 
CTCAATTTACATTGGACAAAGGAGACAGAATTTTTTACGACC 
AAATACTAACTATTGAAGGAAATCAGTGGTTATCTTATAAATC 

25 ATTCAATGGTGTTCGTCGTTTTGTTTTGCTAGGTAAAGCATCTT 
CAGTAGAAAAAACTGAAGATAAAGAAAAAGTGTCTCCTCAAC 
CACAAGCCCGTATTACTAAAACTGGTAGACTGACTATTTCTAA 
CGAAACAACTACAGGTTTTGATATTTTAATTACGAATATTAAA 
GATGATAACGGTATCGCTGCTGTTAAGGTACCGGTTTGGACTG 

30 AACAAGGAGGGCAAGATGATATTAAATGGTATACAGCTGTAA 
CTACTGGGGATGGCAACTACAAAGTAGCTGTATCATTTGCTG 
ACCATAAGAATGAGAAGGGTCTTTATAATATTCATTTATACTA 
CCAAGAAGCTAGTGGGACACTTGTAGGTGTAACAGGAACTAA 
AGTGACAGTAGCTGGAACTAATTCTTCTCAAGAACCTATTGA 

35 AAATGGTTTACCAAAGACTGGTGTTTATAATATTATCGGAAGT 
ACTGAAGTAAAAAATGAAGCTAAAATATCAAGTCAGACCCAA 
TTTACTTTAGAAAAAGGTGACAAAATAAATTATGATCAAGTA 
TTGACAGCAGATGGTTACCAGTGGATTTCTTACAAATCTTATA 
GTGGTGTTCGTCGCTATATTCCTGTGAAAAAGCTAACTACAAG 

40 TAGTGAAAAAGCGAAAGATGAGGCGACTAAACCGACTAGTTA 
TCCCAACTTACCTAAAACAGGTACCTATACATTTACTAAAACT 
GTAGATGTGAAAAGTCAACCTAAAGTATCAAGTCCAGTGGAA 
TTTAATTTTCAAAAGGGTGAAAAAATACATTATGATCAAGTGT 



TAGTAGTAGATGGTCATCAGTGGATTTCATACAAGAGTTATTC 
CGGTATTCGTCGCTATATTGAAATTTAA 



5 (SEQ ID NO: 2) 

MFMMKKGQVNDTKQSYSLRKYKFGLASVILGSFIMVTSPVFADQTTSVQVNN 
QTGTS VD ANNS SNETS AS S VITSNNDS VQ AS DK V VNS QNT ATKDITTPLVETK 
PMVEKTLPEQGNYVYSKETEVKNTPSKSAPVAFYAKKGDKVFYDQVFNKDN 
VKWISYKSFGGVRRYAAIESLDPSGGSETKAPTPVTNSGSNNQEKIATQGNYT 

1 0 FSHKVEVKNEAKVASPTQFTLDKGDRIFYDQILTffiGNQWLS YKSFNGVRRFV 
LLGKASSVEKTEDKEKVSPQPQARITKTGRLTISNETTTGFDILITNIKDDNGIA 
AVKVPVWTEQGGQDDIKWYTAVTTGDGNYKVAVSFADHKNEKGLYNIHLY 
YQEASGTLVGVTGTKVTVAGTNSSQEPffiNGLPKTGVYNIIGSTEVKNEAKISS 
QTQFTLEKGDKINYDQVLTADGYQWISYKSYSGVRRYIPVKKLTTSSEKAKD 

1 5 EATKPTS YPNLPKTGTYTFTKT VDVKSQPKVSSPVEFNFQKGEKIHYDQVLVV 
DGHQWISYKSYSGIRRYIEI* 



Sequence description 
20 A) Length: 1642 bp - 547 aa (full length gene) 

B) Sequence Characteristics: 
Potential leader peptide sequence 
Orf is preceded by a potential Shine- 
Dalgarno sequence. 

25 

ID-66 
Clone 3-5 

30 (SEQ ID NO: 3) 

ATGATATTGAGACGTCGAACTATTGTTTTATGGCAACTGGGTATCGCCATT 
TCTCTCATTCTTAGTATTCTAGCCTTAAATCTTTATTTCCATAGTACTCCCTT 
GCAAACCAATGCAGCTTTACGGAACCTTGCTCCTTCATTAAACCATCTTTT 
TGGGACAGATGGTTTAGGTAGGGATATGTTTGTCAGAACGATTAAAGGAC 

35 TTTATTTCTCTCTACAAGTCGGCTTATTAGGTGCCCTTATGGGGGTCATTCT 
GGCGACAGTTTTTGGAGTGCTTGCAGGTTTAGGAAATAGCATTATTGATAA 
AATAATAGCATGGTTAGTTGATTTGTTTATTGGTATGCCTCATTTGATTTTT 
ATGATTCTCATTTCTTTTGTTGTTGGGAAAGGTGCTCAAGGGGTCATCATT 
GCAACGGCTGTTACACATTGGCCTTCTTTAGCAAGGCTTATCCGCAATGAA 

40 GTCTATCATCTAAAGAATAAAGAATTTGTCCAACTTTCTAAAAGTATGGGA 
AAAACGCCTTATTATATTGTGAGGCATCATATCCTGCCTTTGATTGCTTCTC 
AAATTTTCATTGGTTTTATCCTCTTATTTCCACATGTCATCCTACATGAAGC 
ATCAATGACTTTCTTAGGATTTGGGCTCTCTGCCGAACAACCTTCGGTTGG 
TATCATTCTGTCAGAGGCAGCTAAGCATATCTCTCTTGGAAATTGGTGGTT 



r 



GGTTATCTTTCCAGGACTTTATCTTATTTTGGTTGTCAATGCATTTGATACT 
ATCGGAGAATCTTTAAAGAAACTCTTTTACCCTCAAACTGATCATTTTTAG 



5 (SEQ ID NO: 4) 

MILRRRTIVLWQLGIAISLILSILALNLYFHSTPLQTNAALRNLAPSLNHLFGTD 
GLGRDMFVRTIKGLYFSLQVGLLGALMGVILATVFGVLAGLGNSIIDKIIAWL 
VDLFIGMPHLIFMILISFVVGKGAQGVIIATAVTHWPSLARLIRNEVYHLKNKE 
FVQLSKSMGKTPYYIVRHHILPLIASQIFIGFILLFPHVILHEASMTFLGFGLSAE 
1 0 QPS VGIILSE AAKHISLGNWWLVIFPGLYLILV VN AFDTIGESLKKLF YPQTDHF 



Sequence description 

15 

A) Length: 822 bp - 274 aa (full length gene) 

B) Sequence Characteristics: 
Potential leader peptide sequence 
Orf is preceded by a potential Shine- 

20 Dalgarno sequence. 

ID-78 



25 Clone 3-5b 

(SEQ ID NO: 5) 

ATGACAGAAACATTATTAAGCATTAAAGACCTCTCCATCACCTTCACTCAA 
TACGGAAGATTTTTAAAACCATTTCAATCAACACCGATACAAGCGCTGA 

30 ATTTAGAAATTAAAAAAGGTGAGTTATTAGCTATTATAGGTGCTAGTGGTT 
CGGGGAAGAGTTTATTAGCACATGCTATTATGGATATTCTTCCTAAAAATG 
CATCTGTAACAGGAGATATGATTTATCGTGGTCAATCACTAAATTCTAAAC 
GCATTAAACAGTTGCGAGGAAAAGATATTACGTTGATTCCACAATCAGTT 
AATTATTTAGATCCATCTATGAAAGTCAAACATCAGGTGCGCTTAGGTATC 

35 TCAGAAAATTCAAAGGCTACTCAAGAAGGATTGTTTCAACAGTTTGGTTTA 
AAAGAAAGTGATGGTGACTTGGATCCTTTCCAACTTTCTGGCGGAATGCTC 
CGACGTGTTTTGTTTACAACGTGTATTAGTGATAAGGTTTCTTTGATTATTG 
CGGATGAGCCCACCCCTGGATTACATCCAGATGCTCTGCAAATGGTTTTAG 
ACCAACTACGCTCCTTTGCAGATAAAGGAATAAGCGTTATATTTATCACTC 

40 ATGATATTGTAGCAGCTAGTCAAATTGCTGATCGTATTACTATTTTTAAAG 
AGGGAAAAGCTATTGAAACAGCTCCAGCTAGTTTCTTTAGCGGAAATGGA 
GAGCAGTTACAAACAGAATTTGCTAGAAGTTTATGGCGCTCTCTCCCACAG 
CAAGAATTTTTGAAAGGAGTTACTCATGACCTTAGAGGCTAA 



(SEQ ID NO: 6) 

MTETLLSIKDLSITFTQYGRFLKPFQSTPIQALNLEIKKGELLAIIGASGSGKSLL 
AHAIMDILPKNASVTGDMIYRGQSLNSKRIKQLRGKDITLIPQSVNYLDPSMK 
VKHQVRLGISENSKATQEGLFQQFGLKESDGDLDPFQLSGGMLRRVLFTTCIS 
5 DKVSLIIADEPTPGLHPDALQMVLDQLRSFADKGISVIFrrHDIVAASQIADRITI 
FKEGKAIETAPASFFSGNGEQLQTEFARSLWRSLPQQEFLKGVTHDLRG* 



10 Sequence description 

A) Length: 804 bp - 268 aa (full length gene) 

B) Sequence Characteristics: 

No obvious leader peptide sequence 
15 Orf is preceded by a potential Shine- 

Dalgarno sequence. 

This gene was not isolated using the LEEP 
system. However in determining a full length 
gene sequence for ED-76, this gene was 
20 identified downstream and fully sequenced. 

ID-79 



25 Clone 3-5c 

(SEQ ID NO: 7) 

GTCCATCTGGGGTGGTTCCCGATTGGTATTTCTTCTCCGATAGGTACTTTGA 
GTCAAGATATTACGTTAGCTGATCGTATTAAGCACCTTATTTTACCTGTTTT 

30 CACGGTAAGTATTCTAGGCATTGCCAATGTAACTCTTCATACTAGAACTAA 
AATGATGTCGGTACTTTCTAGTGAATATGTCTTATTTGCCAGAGCGCGTGG 
GGAAACGGAATGGCAAATTTTTAAAAATCATTGTCTTAGAAATGCTATCGT 
ACCAGCTATTACACTGCATTTTTCCTATTTTGGAGAATTGTTTGGAGGATCC 
GTTCTTGCTGAGCAAGTTTTCTCATATCCAGGACTAGGGTCTACCCTAACT 

35 GAAGCAGGACTTAAAAGTGATACACCGCTACTTCTAGCTATTGTGATGATA 
GGGACATTATTTGTTTTTGCGGGCAATCTTATTGCGGATATTTTAAATAGC 
ATAATCAATCCACAGTTAAGGAGAAAAGTATGA 

(SEQ ID NO: 8) -• • - 

40 VHLGWFPIGISSPIGTLSQDITLADRIKHLILPVFTVSILGIANVTLHTRTKMMSV 
LSSEYVLFARARGETEWQIFKNHCLRNAIVPAITLHFSYFGELFGGSVLAEQVF 
SYPGLGSTLTEAGLKSDTPLLLAIVMIGTLFVFAGNLIADILNSIINPQLRRKV* 

Sequence description 



I 



A) Length: 495 bp - 165 aa (partial gene sequence) 

B) Sequence Characteristics: 
N-terminus has yet to be determined. 

5 This gene was not isolated using the LEEP 

system. However in determining a full length 
gene sequence for ID-76, this gene was 
identified upstream. 

10 

ID-80 
Clone 2-17 

15 (SEQ ID NO: 9) 

TTGCGGACAATTACGTTCAAACACAATGAAACGCGATCGTCAAAAAGCGA 
AGGTAGGGCGGTAATGCTTAAAAGATTATTTACTGAAGATGGGGAATTGA 
CAAAGATTAGTCGTCGTTTCGTTTGGATGTTAGTGGTTATCTATTGTCTTAT 
TATTGTCAGGATGTGTTTTGGGCCTCAAATTATGATTGAGGGGGTATCAAC 

20 TCCGAATGTTCAGCGCTTCGGAAGAATTGTAGCTCTTTTAGTACCATTTAA 
TTCTTTTCGTAGTTTAGATCAGCTAACTAGCTTTAAAGAGATTTTTTGGGTT 
ATTGGTCAAAATGTAGTGAATATTTTACTGCTGTTTCCTCTCATTATAGGGT 
TACTATCCCTAAAGCCAAGTTTACGGAAATATAAAAGCGTTATATTACTTG 
CTTTCTTGATGTCTCTTTTCATAGAGTGTACTCAAGTTGTTTTAGATATTTT 

25 AATAGATGCTAATCGGGTTTTTGAAATCGACGATCTATGGACAAATACCTT 
AGGCGGTCCTTTCGCCCTATGGAGTTATCGAAACATAAAAGGTTGGCTTCT 
AACTATTAGAAAATGA 

(SEQ ID NO: 10) 

30 MRTITFKHNETRSSKSEGRAVMLKRLFTEDGELTKISRRFVWMLVVIYCLIIVR 
MCFGPQIMIEGVSTPNVQRFGRIVALLVPFNSFRSLDQLTSFKEIFWVIGQNVV 
NILLLFPLIIGLLSLKPSLRKYKSVILLAFLMSLFIECTQVVLDILIDANRVFEIDD 
LWTNTLGGPFALWSYRNIKGWLLTIRK* 

35 

Sequence description 

A) Length: 579 bp - 193 aa (full length gene) 

- B) Sequence Characteristics: 

40 Possesses a potential leader peptide sequence 

No obvious Shine-Dalgarno, but the 'TTG' codon 
may not be the actual translation start point. 
A methionine (ATG) that occurs -22 codons 
downstream of the 'TTG' is preceded by a 



potential Shine-Dalgarno sequence and may 
represent the actual start codon. 



ID 81 

5 

Clone 3- 1 
(SEQIDNO: 11) 

TTGAAAAATTTAAATCGTTATGTAGTTGCGGTTTCTGGAGTCGTTTTACATT 

1 0 TAATGCTAGG ATC A ACTTATGCTTGG AGTGTGTTTCGT AACCC A ATTATCT 
CAGAGACTGGTTGGGATATTTCATCAGTTTCATTCGCTTTTAGTTTGGCTAT 
TTTTTGTCTAGGAATGTCTGCAGCTTTTATGGGACACTTAGTAGAGCGTTTT 
GGTCCTAGGATAATGGGAATGATTTCTGCTATTTTATATGGAGCAGGGAAT 
GTGTTAACAGGCTTAGCCATTGAAACTCAGCAGTTATGGTTACTGTATGTT 

1 5 GC ATACGGTATTTTAGG AGGA ATCGG ACTTGGTTC AGGTTATATT ACTCC A 
GTATCGACTATTATTAAATGGTTTCCTGATAGGAGGGGACTAGCAACAGG 
ATTCGCTATTATGGGATTTGGCTTTGCTTCTTTAGTAACAAGTCCGCTTGCA 
CAATCCTTACTGATTAGGATTGGTGTGGGTAAAACGTTTTATATTTTGGGA 
TTAGTATATTTTTTTGTCATGATGATTGCCTCACAATTTATTAAACAACCAC 

20 CTCAGGAAAAAATAACTATTTTGACTCACGATGGTAAAAAGAATGCTATG 
AATTCACAAATTATCACTGGATTAAAAGCAAACGTCGCTATAAAATCAAA 
AACCTTTTACATCATTTGGTTGACCTTGTTTATTAATATTTCGTGTGGCTTA 
GGTTTAATATCAGCAGCTTCACCAATGGCACAAGATTTAGCAGGCTATTCC 
GCAGAATCTGCAGCCTTATTAGTAGGGGTACTAGGGATATTTAACGGTTTT 

25 GGACGTCTGTTATGGGCAAGTCTCTCTGACTACATTGGACGCCCGTTGACC 
TTTATAATATTATTTATTGTGAACTTTATTATGACTTCTAGTTTATTTTTGTC 
ATTCAATGCTATTGTATTTGCAATAGCGATGTCTATTTTAATGACTTGTTAT 
GGTGCAGGTTTTTCCTTATTACCTGCTTATCTAAGTGATATTTTTGGAACAA 
AGGAATTAGCTACTTTACATGGTTATAGTTTAACAGCATGGGCAATAGCAG 

30 GTCTGTTTGGGCCCCTATTGTTATCAAAGACATATTCATGGGGAAATTCCT 
ATCAATTGACATTAATGGTTTTTGGTTTTTTATTCTTATTCGGATTATTGTTA 
TCTCTATATTTAAGAAAATTAACAACTAAAGTTGTGTAG 

(SEQ ID NO: 12) 

35 LKNLNRYVVAVSGVVLHLMLGSTYAWSVFRNPIISETGWDISSVSFAFSLAIFC 
LGMSAAFMGHLVERFGPRIMGMISAILYGAGNVLTGLAIETQQLWLLYVAYG 
ILGGIGLGSGYITPVSTIIKWFPDRRGLATGFAIMGFGFASLVTSPLAQSLLIRIG 
VGKTFYILGLVYFFVMMIASQFIKQPPQEKITELTHDGKKNAMNSQIITGLKAN 
VAIKSKTFYIIWLTLFINISCGLGLISAASPMAQDLAGYSAESAALLVGVLGIFN 

40 GFGRLLWASLSDYIGRPLTFIELFIVNFlMTSSLFLSFNAiVFAIAMSILMTCYGA 
GFSLLPAYLSDIFGTKELATLHGYSLTAWAIAGLFGPLLLSKTYSWGNSYQLTL 
MVFGFLFLFGLLLSLYLRKLTTKVV* 



Sequence description: 



A] Length 1221 bp - 407 a.a (full length 
gene). 

B] TTG start codon with Shine-Dalgarno 
sequence upstream. Obvious signal peptide, 
with hydropathy plot exhibiting many possible 
membrane spanning regions, indicating protein 
to be transmembrane. 



ID-82 
Clone 48 
(SEQIDNO: 13) 

ATGGCAGATAAAAACAGAACATTTAAACTTGTAGGTGCAGGATCTTCTAG 

CACACAAGAAAAAATTGAAAAGCCTGCTCTTTCGTTTATGCAAGATGCGTG 

GCGTCGCTTGAAAAAAAACAAATTAGCAGTAGTTTCACTCTATTTATTAGC 

TCTTTTACTTACTTTTTCGTTAGCCTCAAATTTATTTGTAACTCAGAAGGAT 

GCTAATGGGTTTGATTCGAAAAAAGTAACGACATATCGCAACTTACCACCT 

AAATTGAGTTCAAACCTTCCTTTTTGGAATGGTAGCATTAATCCATCA 

(SEQ ID NO: 14) 

MADKNRTFKLVGAGSSSTQEKIEKPALSFMQDAWRRLKKNKLAVVSLYLLA 
LLLTFSLASNLFVTQKDANGFDSKKVTTYRNLPPKLSSNLPFWNGSINPS 



Sequence description: 

A] Current length is 303 bp - 101 aa 

B] No obvious signal peptide but Shine 
Dalgarno sequence upstream of the ATG start 

codon. Not ide3ntified directly using the LEEP system but was found 
directly downstream of ID-34 described in WO 00/06736. 



ID-83 
Clone 98 
(SEQIDNO: 15) 

ATGAAAATAGTAGTACCAGTAATGCCTCGCAGTCTTGAAGAGGCTCAAGA 
AATAGATTTATCAAAATTTGATAGTGTTGATATTATTGAATGGCGAGCTGA 



TGCCTTACCAAAGGATGACATTATTAATGTAGCTCCAGCTATTTTTGAGAA 
ATTCGCAGGTCATGAAATTATTTTTACTTTTCGTACAACGCGTGAAGGTGG 
TAATATTGTCTTATCTGATGCTGAGTATGTTGAGTTAATCCAGAAAATTAA 
TTCTATCTACAATCCAGATTATATTGATTTTGAGTATTTTTCACATAAAGAA 
5 GTTTTTCAAGAAATGCTAGAATTTCCAAATTTAGTCCTGTCTTATCACAATT 
TTCAAGAGACACCGGAGAATATTATGGAGATATTTTCAGAATTAACAGCC 
CTAGCACCACGAGTTGTGAAAATCGCAGTAATGCCAAAGAATGAACAAGA 
TGTCTTAGACGTTATGAATTACACTCGCGGTTTCAAGACTATTAATCCTGA 
TCAAGTTTATGCGACGGTATCTATGAGTAAAATTGGACGTATTTCTCGTTTT 
1 0 GCTGGTG ATGT A ACTGG ATCT AGTTGG AC ATTTGC AT ATTT AG ATTC ATCT 
ATCGCACCCGGACAAATTACTATTTCAGAGATGAAGCGTGTCAAAGCATT 
GCTTGACGCTGACTGA 



(SEQ ID NO: 16) 

1 5 MKIVVPVMPRSLEEAQEIDLSKFDS VDHEWRAD ALPKDDIINVAPAIFEKFAG 
HEIIFTFRTTREGGNIVLSDAEYVELIQKINSIYNPDYIDFEYFSHKEVFQEMLEF 
PNLVLSYHNFQETPENIMEIFSELTALAPRVVKIAVMPKNEQDVLDVMNYTRG 
FKTINPDQVYATVSMSKIGRISRFAGDVTGSSWTFAYLDSSIAPGQITISEMKRV 
KALLDAD* 



20 



30 



Sequence description: 



A] Length 678 bp, 225 aa (full length gene) 
25 B] No obvious signal peptide, but there is a 

Shine Dalgarno immediately upstream of ORF. 



ID-84 

Clone RS-52 



(SEQ ID NO: 17) 

ATGAAAGACTT ATTTGC AACAACAGAAGCATCATCAAGGAAACAGGAACA 
35 AGATAGAATTGTCAATTACATAAAACAACATGTTGAGTTAACAAATGGTA 
ATCAAATAAAAAAAATTGAGTTTATCGACTTTCAAAAAAATGAGATGACA 
GGTACATGGGGAATTTCTACTAAAATTAATGAACAATTTTCGATTAGTTTT 
TCTGAAGATAGAATTGGTGGTAAACTTAGAGCATTAGGATATCAACCGAA 
TGAAATAGGTTTTTCAAAGGACATCAATAGTAATAATGAAAATGTT-AATGA 
40 TATTGAAGTGATTTATATGAAGAAAGAATAG 



(SEQIDNO: 18) 

MKDLFATTEASSRKQEQDRIVNYIKQHVELTNGNQIKKIEFIDFQKNEMTGTW 

GISTKINEQFSISFSEDRIGGKLRALGYQPNEIGFSKDINSNNQNVNDIEVIYMK 

KE* 

5 

Sequence description: 

A] length: 333 bp - 111 aa (partial sequence) 

B] No obvious Shine Dalgarno sequence upstream 
10 of the ATG start codon, and no obvious signal 

peptide within the protein. 

ID-85 

15 

Clone RS-53 
(SEQ ID NO: 19) 

ATGAAAAAACGTATATGGTATTTGATAATAATAATCACAGTAATTTTAGGA 
20 GGACTAGCCATGAAAAACTTATTTGCAACAACAGAAGCATCATCAAGGAA 
ACAGGAACAAGATAGAATTGTCAATTACATAAAACAACATGTTGAGTTAA 
CAAATGGTAATCAAATAAAAAAAATTGAGTTTATCGACTTTCAAAAAAAT 
GAGATGACAGGTACATGGGGAATTTCTACTAAAATTAATGAACAATTTTCG 
ATTAGTTTTTCTGAAGATAGAATTGGTGGTAAACTTAGAGCATTAGGATAT 
25 CAACCGAATGAAATAGGTTTTTCAAAGGACATCAATAGTAATAATCA 

(SEQ ID NO: 20) 

MKKRIWYLIIIITVILGGLAMKNLFATTEASSRKQEQDRIVNYIKQHVELTNGN 
QIKKIEFIDFQKNEMTGTWGISTKINEQFSISFSEDRIGGKLRALGYQPNEIGFSK 
30 DINSNNQ 

Sequence description: 

35 A] Length: 351 bp - 1 17 aa (Partial sequence) 

B] Obvious signal peptide and Shine Dalgarno 
sequence upstream of the ATG start codon. 

40 

ID-86 



Clone ID-74 



(SEQ ID NO: 21) 

ATGTCAAATCAATATGATTATATCGTTATTGGTGGAGGTAGT 
GCAGGCAGTGGTACCGCTAATAGGGCAGCCATGTATGGAGC 
AAAAGTCCTGTTAATTGAAGGTGGACAAGTAGGTGGAACTTG 
5 TGTTAACTTAGGTTGTGTACCTAAGAAAATCATGTGGTATGG 
TGCACAAGTTTCTGAGACACTCCATAAGTATAGTTCAGGTTA 
TGGTTTTGAAGCCAATAATCTTAGTTTTGATTTTACTACTCTA 
AAAGCTAATCGCGATGCTTACGTGCAGCGGTCTAGACAGTCG 
TATGCCGCTAATTTTGAGCGTAATGGGGTCGAAAAGATTGAT 

1 0 GGATTTGCTCGTTTT ATTG AT AACC AT ACT ATTG AAGTG AATG 
GTCAGCAATATAAAGCTCCTCACATTACTATTGCAACAGGTG 
GACACCCTCTTTACCCTGATATTATTGGAAGTGAACTTGGTG 
AGACTTCTGATGATTTTTTTGGATGGGAGACCTTACCAAATTC 
TATATTGATTGTTGGGGCGGGCTATATCGCGGCAGAACTTGC 

15 TGGAGTGGTTAATGAATTAGGCGTTGAAACCCATCTTGCATT 
TAGAAAAGACCATATTCTACGCGGATTTGATGACATGGTAAC 
AAGTGAGGTTATGGCTGAAATGGAGAAATCAGGTATCTCTTT 
ACATGCTAACCATGTACCTAAATCTCTTAAACGGGATGAAGG 
TGGCAAGTTGATTTTTGAAGCTGAAAATGGGAAAACGCTTGT 

20 CGTTGATCGTGTAATATGGGCTATCGGCCGTGGACCAAATGT 
AGACATGGGACTTGAAAATACCGATATTGTTTTAAATGATAA 
AGATTATATCAAAACAGATGAATTTGAGAATACTTCTGTAGA 
TGGCGTGTATGCTATTGGAGATGTTAATGGGAAAATTGCCTT 
GACACCGGTAGCAATTGCAGCAGGTCGTCGCTTATCAGAAAG 

25 ACTTTTTAATCATAAAGATAACGAAAAATTAGATTACCATAA 
TGTACCTTCAGTTATTTTTACTCACCCTGTAATTGGGACGGTA 
GGACTTTCAGAAGCAGCAGCTATCGAGCAATTTGGAAAAGAT 
AATATCAAAGTCTATACATCAACTTTTACCTCTATGTATACGG 
CTGTTACCAGTAATCGCCAAGCAGTTAAGATGAAGCTCATAA 

30 CCCTAGGAAAAGAGGAAAAAGTTATTGGGCTTCATGGTGTTG 
GTTATGGTATTGATGAAATGATTCAAGGTTTTTCAGTTGCTAT 
CAAAATGGGGGCTACTAAAGCAGACTTTGATGATACTGTTGC 
TATTCACCCAACTGGATCTGAGGAATTTGTTACAATGCGCTA 
A 

35 

(SEQ ID NO: 22) 

MSNQYDYIVIGGGSAGSGTANRAAMYGAKVLLIEGGQVGGTC 
VNLGCVPKKIMWYGAQVSETLHKYSSGYGFEANNLSFDFTTLK 

ANRDAYVQRSRQSYAANFERNGVEKIDGFARFIDNHTIEVNGQ - - 

40 QYKAPHITIATGGHPLYPDHGSELGETSDDFFGWETLPNSILIVG 
AGYIAAELAGVVNELGVETHLAFRKDHILRGFDDMVTSEVMAE 
MEKSGISLHANHVPKSLKRDEGGKLIFEAENGKTLVVDRVIWAI 
GRGPNVDMGLENTDIVLNDKDYIKTDEFENTSVDGVYAIGDVN 
GKIALTPVAIAAGRRLSERLFNHKDNEKLDYHNVPSVIFTHPVIG 



TVGLSEAAAffiQFGKDNIKVYTSTFTSMYTAVTSNRQAVKMKLI 
TLGKEEKVIGLHGVGYGIDEMIQGFSVAIKMGATKADFDDTVAI 
HPTGSEEFVTMR* 

5 

ID-87 

Clone RS-55 

10 (SEQIDNO:23) 

ATGACAAAAAAACATCTTAAAACGCTTGCCTTGGCACTTACTACAGTATCA 
GTAGTGACATACAGCCAGGAGGTATATGGATTAGAAAGAGAGGAATCGGT 
CAAACAAGAACAAACCCAGTCAGCTTCAGAAGATGATTGGTTCGAAGAAG 
ATAATGAGAGGAAAACAAATGTTTCTAAAGAGAATTCTACTGTTGATGAA 

1 5 AC AGTTAGTG ATTT ATTTTCTG ATGGA A AT AGT AATA ACTCT AGTTCT A AA 
ACCGAGTCAGTGGTAAGTGACCCTAAACAAGTCCCCAAAGCAAAACCAGA 
GGTTACACAAGAAGCAAGCAATTCTAGTAATGATGCTAGCAAAGTAGAAG 
TACCAAAACAGGATACAGCTTCAAAAAAGGAAACTCTAGAAACATCAACT 
TGGGAGGCAAAAGATTTCGTAACTAGAGGGGATACTTTAGTAGGTTTTTCA 

20 AAATCTGGAATTAATAAGTTATCTCAAACATCACACTTGGTTTTACCAAGT 
CATGCAGCAGATGGAACTCAATTGACACAAGTAGCTAGCTTTGCTTTTACT 
CCAGATAAAAAGACGGCCATTGCAGAATATACAAGTAGGCTAGGAGAAA 
ATGGGAAACCGAGTCGTTTAGATATTGATCAGAAGGAAATTATTGATGAG 
GGAGAAAT ATTT AATGCTTACCAGTTGACTAAGCTT ACT ATTCCAAATGGT 

25 TATAAGTCTATTGGTCAAGATGCTTTTGTGGACAATAAGAATATTGCTGAG 
GTTAACCTTCCTGAGAGTCTCGAGACTATTTCAGACTATGCTTTTGCTCACA • 
TGTCTTTAAAACAAGTAAAGTTACCAGATAACCTAAAGGTCATTGGAGAA 
TTAGCTTTTTTTGATAATCAGATTGGTGGTAAGCTTTACTTGCCACGTCACT 
TGATAAAATTAGCAGAACGCGCTTTCAAATCTAATCGTATTCAAACAGTTG 

3 0 A ATTTTTGGG AAGT A AGCTT A AGGTT AT AGG AG A AGC A AGTTTTC AAG AT 
AATAATCTGAGGAATGTTATGCTTCCGGATGGACTTGAAAAAATAGAATC 
AGAAGCTTTTACAGGAAATCCAGGAGATGAACATTACAACAATCAGGTTG 
TATTGCGCACAAGGACAGGCCAAAATCCACATCAACTTGCGACTGAGAAT 
ACTTACGTCAATCCGGACAAATCATTGTGGCGTGCAACACCTGATATGGAT 

35 TATACCAAATGGTTAGAGGAAGATTTTACCTATCAAAAAAATAGTGTTACA 
GGTTTTTCAAATAAAGGCTTACAAAAGGTAAGACGTAATAAAAACTTAGA 
AATTCCAAAACAACACAATGGTATTACTATTACTGAAATTGGTGATAACGC 
TTTTCGCAATGTTGATTTTCAAAGTAAAACTTTACGTAAATATGATTTGGA 
AGAAATAAAGCTGCCGTGAACTATTCGGAAAATAGGTGCTTTTGGTTTTGA 

40 ATCTAATAACTTGAAATCCTTTGAAGCAAGTGAAGATTTAGAAGAGATTA 
AAGAGGGAGCCTTTATGAATAATCGTATTGGAACTCTAGACTTGAAAGAC 
AAACTTATCAAAATAGGTGATGCTGCTTTCCAT ATT AATCAT ATTT ATGCC 
ATTGTTCTTCCAGAATCTGTACAAGAAATAGGACGTTCAGCTTTTCGACAA 
AATGGTGCGCTTCACCTTATGTTTATCGGAAATAAGGTTAAAACAATTGGT 



GAAATGGCTTTTTTATCCAATAAACTGGAAAGTGTAAATCTCTCTGAGCAA 
AAACAATTAAAGACAATTGAGGTCCAAGCTTTTTCGGATAATGCCCTTAGT 
GAAGTAGTCTTACCGCCAAATTTACAGACTATTCGTGAAGAGGCTTTCAAA 
AGGAATCATTTGAAAGAAGTGAAGGGTTCATCTACATTATCTCAGATTACT 
5 TTTAATGCTTTTGATCAAAATGATGGGGACAAACGCTTTGGTAAGAAAGTG 
GTTGTTAGGACACATAATAATTCTCATATGTTAGCAGATGGTGAGCGTTTT 
ATCATTGATCCAGATAAGCTATCTTCTACAATGGTAGACCTTGAAAAGGTT 
TTAAAAATAATCGAAGGTTTAGATTACTCTACATTACGTCAGACTACTCAA 
ACTCAGTTTAGAGAAATGACTACTGCAGGTAAAGCGTTGTTATCAAAATCT 

1 0 A ACCTCCG AC A AGG AGAA AA AC AAA A ATTCCTTC A AG A AGCAC AATTTTT 
CCTTGGTCGCGTTGATTTGGATAAAGCCATAGCTAAAGCTGAGAAGGCTTT 
AGTGACCAAGAAGGCAACAAAGAATGGTCATTTGCTTGAGAGGAGTATTA 
ACAAAGCGGTATTAGCTTATAATAATAGTGCTATTAAAAAAGCTAATGTTA 
AGCGCTTGGAAAAAGAGTTAGACTTGCTGACAGATTTAGTCGAGGGAAAA 

1 5 GG ACCATT AGCGC A AGCTACAATGGTAC A AGG AGTTT ATTT ATT A AAGAC 
GCCTTTACCATTGCCAGAATATTATATCGGATTGAACGTTTATTTTGACAA 
GTCTGGAAAATTGATTTATGCACTTGATATGAGTGATACTATTGGCGAGGG 
ACAAAAAGATGCATATGGTAATCCTATATTAAATGTTGACGAGGATAATG 
AAGGTTATCATACCTTGGCAGTTGCCACTTTAGCTGATTATGAAGGTCTTT 

20 ATATTAAAGATATTTTAAATAGTTCCCTTGATAAGATTAAAGCAATACGCC 
AGATTCCTTTGGCAAAATATCATAGATTAGGAATTTTCCAAGCTATCCGAA 
ATGCAGCGGCAGAAGCAGACCGATTGCTTCCTAAGACACCTAAGGGGTAC 
CTAAATGAAGTCCCAAATTATCGTAAAAAACAAATGGAGAAAAATTTAAA 
ACCAGTTGATTATAAAACGCCGATTTTTAATAAGGCTTTACCTAATGAAAA 

25 GGTAGACGGTGATAGAGCGGCTAAAGGTCATAATATAAATGCGGAGACTA 
ATAATTCTGTAGCTGTAACACCAATAAGGTCCGAGCAGCAATTACATAAGT 
CACAGTCTGATGTAAATTTACCTCAAACAAGTTCTAAAAATAATTTTATAT 
ACGAGATTCTAGGATACGTTAGTTTATGTTTGCTTTTCCTAGTAACTGCTGG 
GAAAAAAGGAAAACGAGCAAGAAAATAA 

30 

(SEQ ID NO: 24) 

MTKKHLKTLALALTTVSVVTYSQEVYGLEREESVKQEQTQSASEDDWFEEDN 
ERKTNVSKENSTVDETVSDLFSDGNSNNSSSKTESVVSDPKQVPKAKPEVTQE 
ASNS SND AS K VE VPKQDT AS KKETLETST WE AKDF VTRGDTLVGFS KS GINKL 

35 SQTSHLVLPSHAADGTQLTQVASFAFTPDKKTAIAEYTSRLGENGKPSRLDEDQ 
KEIBDEGEIFNAYQLTKLTIPNGYKSIGQDAFVDNKNIAEVNLPESLETISDYAF 
AHM S LKQ VKLPDNLK VIGELAFFDNQIGGKL YLPRHLIKLAER AFKSNRIQT V 
EFLGSKLKVIGEASFQDNNLRNVMLPDGLEKIESEAFTGNPGDEHYNNQVVLR 
TRTGQNPHQLATENTYVNPDKSLWRATPDMDYTKWLEEDFTYQKNSVTGFS 

40 NKGLQKVRRNKNLEIPKQHNGITITEIGDNAFRNVDFQSKTLRKYDLEEIKLPS 
TIRKIGAFAFQSNNLKSFEASEDLEEIKEGAFMNNRIGTLDLKDKLIKIGDAAFH 
INHIYAIVLPESVQEIGRSAFRQNGALHLMFIGNKVKTIGEMAFLSNKLESVNL 
SEQKQLKTffiVQAFSDNALSEVVLPPNLQTIREEAFKRNHLKEVKGSSTLSQITF 
NAFDQNDGDKRFGKKVVVRTHNNSHMLADGERFIIDPDKLSSTMVDLEKVL 



KlffiGLDYSTLRQTTQTQFREMTTAGKALLSKSNLRQGEKQKFLQEAQFFLGR 

VDLDKAIAKAEKALVTKKATKNGHLLERSINKAVLAYNNSAIKKANVKRLEK 

ELDLLTDLVEGKGPLAQATMVQGVYLLKTPLPLPEYYIGLNVYFDKSGKLIYA 

LDMSDTIGEGQKDAYGNPILNVDEDNEGYHTLAVATLADYEGLYIKDILNSSL 

DKIKAIRQIPLAKYHRLGIFQAIRNAAAEADRLLPKTPKGYLNEVPNYRKKQM 

EKNLKPVDYKTPIFNKALPNEKVDGDRAAKGHNINAETNNSVAVTPIRSEQQL 

HKSQSDVNLPQTSSKNNFIYEILGYVSLCLLFLVTAGKKGKRARK* 



Sequence description: 

A] Length 3168 bp - 1056 aa (Partial sequence) 

B] Obvious signal peptide with Shine Dalgarno 
sequence upstream of the ATG start codon. 



ID-88 

Clone RS-56 
(SEQIDNO:25) 

GCAGGATACATCATGCACAAGCACGAGGCTATCGTGTCATGCTGGGGTCA 
ACCCAGGAAGACATGTCGGCACAAGCTGAAGATTTCTTTACAGTCTGTACA 
CAATAAAGAGACGGGTAAGAGCGCTTTTAATGACAAAGAACGACTAGCAA 
TT 

(SEQ ED NO: 26) 

AGYIMHKHEAIVSCWGQPRKTCRHKLKISLQSVHNKETGKSAFNDKERLAI 



Sequence description: 

A] Length: 153 bp - 51 aa (partial sequence) 

B] No signal peptide visible, insufficient 
sequence data to determine the presence of a 
Shine Dalgarno sequence. 



ID-89 



Clone RS-58 



(SEQ ID NO: 27) 

GTGTCATTTATGCAAAGAAAATCCTATTTAAAATCCATGAGTGTTCTTACT 
TTAACAGCTTGTCTTATATCAGGATATGTGGTTAAAGATATTGCTATGTTA 
CATGCAGTATCTGCCAGTGAGAAGAAAGCAAATAATGTCAGTCCGAGAGA 
5 AAATCTCTACAGGGCTGTCAATGATAATTGGCTAGCCAATACAAAACTCA 
AACAAGGGCAGACTAGTGTTAATAGTTTTTCAGAAATTGAGGATAAATTA 
AAGCAACTGTTAGTGTCTGATATGGCTAAAATGGCCTCAGGAAAGATTGA 
AACAACCAATGATGAACAGAAAAAAATGGTTGCATACTATAAACAAGGTA 
TGGACTTTAAAACAAGAGATAAAAATGGTCTCAAACCTCTAAAACCAGTT 

1 0 TTACA AA A ACTTG A AGC AGTCTCTTCAATG AAAG ACTTTC AA AGTTTGGCC 
CATGATTTTGTGATGAGTGGTTTTGTTTTACCATTTGGTTTGACTGTGGAAA 
CCAATGCTCGAGATAATAGCCAAAAGCAATTGGTGCTTCGTCAAGCACCC 
GCATTACTTGAATCACCTGACCAATATAAGAAGGGCAATAAAGAAGGTGA 
GGCTAAATTATCAGCTTACCGTACTTCAGCAATGGCTTTGCTTAAACAAGC 

1 5 TGG AAAA AGTA AC ATTG AAGAT AG AAA ACT AGTT AA AC AAGCT AT AGC AT 
TTGATAGACTCTTATCAGAAAAAACGCAAGTTGATCAAAGTAAAATCACA 
GCTGAAAGTGAGACAGCTGCGGGGCGATATAACCCTGAAAGTATGGAAAC 
GGTTCACAATTACGCCAAGGAATTTGACTTTAAAGAATTGATTGAAAAACT 
AGTTGGGCCAACGAATAAGGCAGTCAATGTAGAAGATAAAACTTATTTTA 

20 AACAGGTTAATGATGTTATAAATAGTAAACAATTAGCCAATATGAAAGCA 
TGGATGATGATTTCTATGCTAGTTGATCAATCAGATTTTCTAGGAGAACAA 
AATCGTCAAGCAGCGAGTGCTTTTAAGAATGTTGCGTCTGGTTTGACTCAG 
ATTGAATCGAAAGAAAAAATGCTTACACCCAATTAG 

25 

(SEQ ID NO: 28) 

MSFMQRKSYLKSMSVLTLTACLISGYVVKDIAMLHAVSASEKKANNVSPREN 
LYRAVNDNWLANTKLKQGQTSVNSFSEIEDKLKQLLVSDMAKMASGKIETTN 
DEQKKMVAYYKQGMDFKTRDKNGLKPLKPVLQKLEAVSSMKDFQSLAHDF 
30 VMSGFVLPFGLTVETNARDNSQKQLVLRQAPALLESPDQYKKGNKEGEAKLS 
AYRTSAMALLKQAGKSNIEDRKLVKQAIAFDRLLSEKTQVDQSKrrAESETAA 
GRYNPESMETVHNYAKEFDFKELIEKLVGPTNKAVNVEDKTYFKQVNDVINS 
KQLANMKAWMMISMLVDQSDFLGEQNRQAASAFKNVASGLTQIESKEKMLT 
PN* 

35 

Sequence description: 

A] Length: 1095 bp - 365 aa (full length gene) - 

40 B] an GTG (possible ATG start codon located 7 bp 

further downstream) start codon with an obvious 
signal peptide. Shine Dalgarno sequence present 
upstream of the ORF. 



ID-90 



Clone RS-59 

5 

(SEQ ID NO: 29) 

ATGGAAATGCCTAAAAGAAATGAATTACTCAATAAAGAAATTAAAATGAG 
TATTGATAAACTTAGATATAAAGAACCAGAGAGTGAACATGACAAGCGAC 
CTACTTTTTATTTGGTAGTACTTATACTTGTTACTGTAGCAGTTATATTGTC 
10 GTTATTTAAATATTTTTTATAG 



(SEQ ID NO: 30) 

MEMPKRNELLNKEIKMSIDKLRYKEPESEHDKRPTFYLVVLILVTVAVILSLFK 
YFL* 



Sequence description: 

A] Length: 174 bp - 58 aa(full length gene) 
20 B] No obvious signal peptide, but Shine 

Dalgarno sequence is present upstream of ATG 
start codon. 



25 

ID-91 

Clone RS-62 (partial sequence) 

30 (SEQ ID NO: 31) 

ATGCAGGTATTTTTAAATATTGTCAATAAATTCTTTGATCCAGTTATTCATA 
TGGGTTCGGGAGTTGTGATGCTAATTGTCATGACAGGTTTAGCCATGATAT 
TTGGAGTGAAGTTTTCTAAAGCACTTGAAGGTGGTAT 

35 (SEQ ID NO: 32) 

MQVFLNIVNKFFDPVIHMGSGVVMLIVMTGLAMIFGVKFSKALEGG 



Sequence description: 

40 

A] Length: 141 bp - 41 aa (partial sequence 

B] Shine Dalgarno sequence present upstream of 
ATG start codon with a possible signal peptide 
present 




5 Clone RS-69 (partial sequence) 
(SEQ ID NO: 33) 

ATGAAAAAGAAAACATTCAGTGCTTATAACTTTTTAACGGCTCTTATCCTT 
TGTCTTTTGACAGTGCTTTTTATCTTTCCATTTTATTGGATTATGACAGGAG 
10 CTTTTAA 

(SEQ ID NO: 34) 

MKKKTFSAYNFLTALILCLLTVLFIFPFYWIMTGAF 

15 

Sequence description: 

A] Length: 1 10 bp -36 aa (Partial sequence) 

B] Possible signal peptide with Shine Daigarno 
20 sequence directly upstream of the ATG start 

codon. 



25 ID-93 

Clone RS-70 
(SEQ ID NO: 35) 

30 ATGACTGAGAACTGGTTACATACTAAAGATGGTTCAGATATTTATTATCGT 
GTCGTTGGTCAAGGTCAACCGATTGTTTTTTTACATGGCAATAGCTTAAGT 
AGTCGCTATTTTGATAAGCAAATAGCATATTTTTCTAAGTATTACCAAGTT 
ATTGTTATGGATAGTAGAGGGCATGGCAAAAGTCATGCAAAGCTAAATAC 
CATTAGTTTCAGGCAAATAGCAGTTGACTTAAAGGATATCTTAGTTCATTT 

3 5 AG AG ATTG AT AAAGTT AT ATTGGT AGGCC AT AGCG ATGGTGCT A ATTT AGC 
TTTAGTTTTTCAAACGATGTTTCCAGATATGGTTAGAGGGCTTTTGCTTAAT 
TCAGGGAACCTGACTATTCATGGTCAGCGATGGTGGGATATTCTTTTAGTA 
AGGATTGCCTATAAATTCCTTCACTATTTAGGGAAACTCTTTCCGTATATG 
AGGCAAAAAGCTCAAGTTATTTCGCTTATGTTGGAGGATTTGAAGATTAGT 

40 CCAGCTGATTTACAGCATGTGTCAACTCCTGTAATGGTTTTGGTTGGAAAT 
AAGGACATAATTAAGTTAAATCATTCTAAGAAACTTGCTTCTTATTTTCCA 
AGGGGGGAGTTTTATTCTTTAGTTGGCTTTGGGCATCACATTATTAAGCAA 
GATTCCCATGTTTTTAATATTATTGCAAAAAAGTTTATCAACGATACGTTG 
AAAGGAGAAATTGTTGAAAAAGCTAATTGA 



(SEQ ID NO: 36) 

MTENWLHTKDGSDIYYRVVGQGQPrVFLHGNSLSSRYFDKQIAYFSKYYQVIV 
MDSRGHGKSHAKLNTISFRQIAVDLKDILVHLEIDKVILVGHSDGANLALVFQ 
5 TMFPDMVRGLLLNSGNLTIHGQRWWDILLVRIAYKFLHYLGKLFPYMRQKA 
QVISLMLEDLKISPADLQHVSTPVMVLVGNKDIIKLNHSKKLASYFPRGEFYSL 
VGFGHHIIKQDSHVFNIIAKKFINDTLKGErVEKAN* 



10 Sequence description: 

A] Length: 744 bp - 248 aa (full length gene) 

B] No obvious signal peptide, but Shine 
Dalgarno sequence upstream of the ATG start 

15 codon. 



ID-94 
20 Clone RS-71 

(SEQ ID NO: 37) 

ATGGTAGCAAAAGAGTTAGGTAAAAATAGCTTTACTATCCCAACTATTTGT 
TCTAATTGCTCCGCAGGTACTGCCATTGCAGTTGTATATAATGATGACCAT 

25 TCTTTCTTAAGATACGGCTATCCCGAGTCTCCACTTCATATTTTTATCAATA 
CACGGATCATTGCACAGGCACCAAGCAAATATTTTTGGGCTGGTATTGGGG 
ACGGTATTTCAAAAGCCCCTGAAGTAGAACGTGCTACCTTAGAGGCTAAG 
ACCAATAAACTACCACATACTGCAGTGTTAGGACAAGCAGTCGCTCTGTCT 
TCAAAGGAAGCTTTTTATCAATTTGGTGAACAAGGTCTAAAAGACGTTGAA 

30 GCTAATTTAGCTTCGCGTGCAGTTGAAGAAATTGCGCTTGATATCTTA 

(SEQ ID NO: 38) 

MVAKELGKNSFTIPTICSNCSAGTAIAVVYNDDHSFLRYGYPESPLHIFrNTRIIA 
QAPSKYFWAGIGDGISKAPEVERATLEAKTNKLPHTAVLGQAVALSSKEAFY 
35 QFGEQGLKDVEANLASRAVEEIALDIL 



Sequence description: 

40 A] Length: 405 bp - 135 aa (Partial sequence) 

B] No obvious Shine Dalgarno sequence upstream 
of the ATG start codon, probable signal 
peptide present at the N-terminus. 



ID-95 



Clone RS-73 

5 (SEQ ID NO: 39) 

TTGAGGGAAACTTACTGGAAAATTTCAAGCGATTGCGATAAAATAAATCTT 
GCAGAGTTTTCTAGAGAAAGGAGGTCAGATTTATTGGAGTGGCAAGATCT 
AGCGCAGTTACCTGTATCTATTTTTAAAGACTATGTTACAGATGCTCAAGA 
CGCGGAAAAACCTTTTATATGGACAGAAGTATTTTTAAGGGAGATTAATCG 

1 0 CTC A A ATC AAGA AATTATTTTGC ATATTTGGCCGATGACT AAG AC AGTC AT 
TCTGGGGATGTTAGATCGAGAATTACCACATTTAGAATTAGCTAAAAAAG 
AAATCATCAGTCGTGGTTATGAACCAGTTGTTCGGAATTTTGGAGGTCTCG 
CAGTTGTAGCTGATGAAGGAATTTTAAATTTTTCATTGGTTATTCCAGATGT 
TTTTGAGAGAAAATTGTCTATCTCAGATGGGTATCTTATAATGGTCGATTTT 

1 5 ATTAGAAGTATATTTTCGGATTTTTATCAACCTATTGAGCACTTTG AAGTA 
GAGACCTCCTATTGTCCTGGTAAGTTTGATCTTAGTATAAATGGCAAAAAA 
TTTGCTGGCTTGGCTCAGCGCCGTATAAAGAATGGTATTGCGGTATCAATT 
TACCTTAGCGTTTGTGGCGATCAAAAAGGGCGGAGTCAAATGATTTCAGAT 
TTTTATAAGATTGGTCTAGGTGATACGGGTAGTCCAATTGCTTATCCAAAT 

20 GTAGATCCTGAAATTATGGCTAATCTATCTGATCTATTAGATTGTCCTATG 
ACAGTAGAAGATGTTATTGATCGTATGTTGATTAGCCTTAAACAAGTAGGT 
TTTAATGATCGTTTACTGATGATTAGACCCGATTTAGTTGCAGAGTTTGAT 
AGATTTCAGGCTAAGTCTATGGCTAATAAGGGGATGGTGAGCAGAGATGA 
ATAA 

25 

(SEQ ID NO: 40) 

MRETYWKISSDCDKINLAEFSRERRSDLLEWQDLAQLPVSIFKDYVTDAQDAE 
KPFIWTEVFLREINRSNQEIILHIWPMTKTVILGMLDRELPHLELAKKEnSRGYE 
PVVRNFGGLAVVADEGILNFSLVIPDVFERKLSISDGYLIMVDFIRSIFSDFYQPI 
30 EHFEVETSYCPGKFDLSINGKKFAGLAQRRIKNGIAVSIYLSVCGDQKGRSQMI 
SDFYKIGLGDTGSPIAYPNVDPEIMANLSDLLDCPMTVEDVIDRMLISLKQVGF 
NDRLLMIRPDLVAEFDRFQAKSMANKGMVSRDE* 



35 Sequence description: 

A] Length: 921 bp -307 aa (Full-length gene sequence) 

B] No obvious Shine Dalgarno sequence upstream 
of the TTG start codon or signal peptide 

40 visible. Actual start point may be a further 

85 bp downstream (TTG). This start point is 
preceded by a typical Shine-Dalgarno sequence. 



ID-96 

Clone RS-74 

5 

(SEQIDNO: 41) 

TTGGAAGGTTTACTTATTGCATTGATTCCCATGTTTGCGTGGGAAAGTATT 
GGATTTGTTAGTAATAAAATTGGAGGGCGTCCAAATCAACAAACATTTGG 
AATGACTTTAGGAGCATTGCTATTTGCGATTATCGTATGGTTATTTAAACA 

1 0 GCCAGAG ATG ACTGCCTC ATTGTGG ATTTTTGGTATCTTAGGTGGTATCCT 
ATGGTCAGTCGGCCAAAATGGTCAATTTCAAGCAATGAAATATATGGGAG 
TCTCTGTTGCTAATCCACTGTCAAGTGGTGCACAATTAGTAGGTGGAAGCC 
TAGTTGGTGCTTTAGTCTTTCATGAATGGACTAAGCCAATCCAATTTATTTT 
AGGATTGACAGCGTTGACATTATTAGTTATCGGCTTCTATTTCTCAAGTAA 

1 5 ACGTGATGTTTC AG A AC AAGCTTTGGC A ACAC ATC A AG AGTTTTCA AAAG 
GATTTGCTACAATTGCTTATTCAACTGTAGGTTACATCTCGTACGCAGTTTT 
ATTTAACAACATTATGAAGTTCGACGCTATGGCCGTCATTTTACCCATGGC 
TGTTGGAATGTGTCTAGGTGCAATTTGTTTCATGAAGTTTCGTGTTAACTTT 
GAGGCTGTTGTTGTTAAAAATATGATTACAGGTCTCATGTGGGGCGTTGGT 

20 AATGTCTTCATGTTATTGGCAGCAGCTAAAGCAGGGCTAGCAATTGCTTTT 
AGTTTTTCTCAACTTGGAGTAATTATCTCTATTATTGGTGGTATTTTATTTTT 
AGGTGAGACAAAAACGAAGAAAGAGCAGAAATGGGTTGTCATGGGTATC 
CTTTGTTTTGTTATGGGTGCTATATTACTTGGTATTGTTAAATCTTATTAA 

25 (SEQ ID NO: 42) 

MEGLLIALIPMFAWESIGFVSNKIGGRPNQQTFGMTLGALLFAIIVWLFKQPEM 
T ASLWIFGILGGILWS VGQNGQFQ AM KYMGVS VANPLS SGAQLVGGS L VG AL 
VFHEWTKPIQFILGLTALTLLVIGFYFSSKRDVSEQALATHQEFSKGFATIAYST 
VGYISYAVLFNNIMKFDAMAVILPMAVGMCLGAICFMKFRVNFEAVVVKNMI 

30 TGLMWGVGNVFMLLAAAKAGLAIAFSFSQLGVIISIIGGILFLGETKTKKEQK 
WVVMGILCFVMGAILLGIVKSY* 



Sequence description: 

35 

A] Length: 867 bp - 289 aa (full-length gene) 

B] Posible Shine Dalgarno sequence upstream of 
GTG start codon, no obvious signal peptide 

present. 

40 

ID-97 

Clone RS-75 



(SEQ ID NO: 43) 

ATGACAACTTACTACGAAGCTATAAACTGGAACGAAATTGAAGATGTTAT 
TGATAAATCAACTTGGGAAAAACTAACCGAACAATTTTGGCTCGATACAC 
5 GTATCCCTTTATCAAATGACTTAGACGATTGGCGCAAACTTTCCGCTCAAG 
AAAAAGATCTTGTTGGCAAGGTTTTTGGAGGCTTAACCCTACTTGATACCA 
TGCAATCAGAAACTGGTGTTGAAGCTATTCGTGCCGATGTTCGCACGCCTC 
ACGAAGAAGCTGTCTTAAACAATATTCAATTCATGGAATCTGTTCACGCTA 
AATCTTATTCTTCAATTTTCTCAACTTTAAATACTAAATCAGAAATTGAAG 

1 0 A AATTTTCGAGTGG ACTAAT AATAATG AGTTCCTTC A AG AAA AAGC ACGT 
ATTATCAATGACATTTATGCTAATGGAAATGCCCTTCAAAAAAAGGTGGCT 
TCCACCTACCTCGAAACTTTCCTTTTTTATTCTGGCTTTTTCACACCTCTTTA 
CTATTTGGGAAATAATAAGTTAGCAAATGTTGCTGAAATCATTAAATTAAT 
TATTCGTGATGAATCTGTACATGGTACTTATATCGGTTACAAATTCCAGCTT 

1 5 GGTTTTAACGA ATTACCAG A AG ATGAGC A AG AGAATTTTCGTGATTGGAT 
GTATGACCTCCTTTATCAGCTGTATGAAAACGAAGAAAAATACACCAAGA 
CACTTTATGATGGCGTAGGATGGACTGAAGAAGTTATGACCTTTTTACGCT 
ACAATGCTAATAAAGCTCTTATGAATTTAGGACAAGATCCTTTATTCCCAG 
ATACAGCAAATGATGTCAACCCAATTGTTATGAATGGTATTTCAACAGGAA 

20 CATCAAACCATGACTTCTTCTCTCAAGTAGGTAATGGTTACCTACTTGGTA 
GCGTTGAAGCTATGCATGATGATGACTATAACTATGGATTATAA 

(SEQ ID NO: 44) 

MTTYYEAINWNEIEDVIDKSTWEKLTEQFWLDTRIPLSNDLDDWRKLSAQEK 
25 DLVGKVFGGLTLLDTMQSETGVEAIRADVRTPHEEAVLNNIQFMESVHAKSY 
SSIFSTLNTKSEIEEIFEWTNNNEFLQEKARIINDIYANGNALQKKVASTYLETF 
LFYSGFFTPLYYLGNNKLANVAEIIKLIIRDESVHGTYIGYKFQLGFNELPEDEQ 
ENFRDWMYDLLYQLYENEEKYTKTLYDGVGWTEEVMTFLRYNANKALMNL 
GQDPLFPDTANDVNPIVMNGISTGTSNHDFFSQVGNGYLLGSVEAMHDDDYN 
30 YGL* 



Sequence description: 

35 A] Length: 960 bp - 320 aa (full length gene) 

B] Shine Dalgarno sequence present upstream of 
ATG start codon, but no signal peptide 
present. 

40 

ID-98 



Clone RS-77 (partial sequence) 



(SEQ ID NO: 45) 

ATGAATTGGTCACGTATCTGGGAACTCGTAAAAATTAATATCCTTTATTCA 
AACCCTCAGACTCTATCGGCACTAAGAAAAAAGCAAGAAAAGCATCCTAA 
AAAAGAATTTTCAGCTTATAAATCCATGTTTAGAAATCAGTTATTTCAGAT 
5 TTTGCTCTTTTCAATAATTTATGTATTTCTCTTTGTATCACTTGATTTTAAAG 
AATATCCGGGCTATTTCACGTTCTACATTGGTATCTTTACACTAGTATCCAT 
TATCTACTCTTTTATTGCGATGTACAGTGTTTTCTATGAGAGTGACGATGTT 
AA 

10 (SEQ ID NO: 46) 

MNWSRIWELVKINILYSNPQTLSALRKKQEKHPKKEFSAYKSMFRNQLFQILL 
FSnYVFLFVSLDFKEYPGYFTFYIGIFTLVSIIYSFIAMYSVFYESDDV 



15 Sequence description: 

A] Length: 31 1 bp - 103 aa (Partial sequence) 

B] Shine Dalgarno sequence present upstream of 
ATG start codon, no obvious signal peptide at 

20 N-terminus. 



ID-99 

25 Clone RS-78 (partial sequence) 
(SEQ ID NO: 47) 

TAATCTTTTAGTCAACGGAGCAACAGGAAAATTGCAGGCTATGCGACAGA 
TATTCCACCACATAATTTAGCAGAAGTCATTGATGCTGTCGTGTACATGAT 
30 TGATCACCCTAAAGCTAAATTAGATAAATTAATGGAATTTCTACCTGGTCC 
AGATTTTCCAACTGGCGCTATCATTCAAGGAAAAGATGAAATTCGTAAGG 
CATATGAGACTGGTAAGGGGAGAGTAGCGGTTCGCTCGCGAACTGCTATT 
GAAACCTTAAAAGGTGGTAAGAAACAAATTATTGTTACTGAAATTCCTTAT 
GAAGTTAAT 

35 

(SEQ ID NO: 48) 

SFSQRSNRKIAGYATDIPPHNLAEVIDAVVYMIDHPKAKLDKLMEFLPGPDFPT 
GAIIQGKDEIRKAYETGKGRVAVRSRTAIETLKGGKKQnVTEIPYEVN 

40 

Sequence description: 

A] Length: 312 bp - 104 aa (Partial sequence) 

B] No obvious Shine Dalgarno sequence or a 



signal peptide. Both N- and C- termini of ORF 
yet to be elucidated. 

5 ID- 100 

Clone RS-79 
(SEQ ID NO: 49) 

1 0 ATGGG ACGTA AGTGGGCC AATATTGTTGCC A AAA AG ACTGCTA A AG ATGG 
TGCTAACTCAAAAGTATACGCTAAATTCGGTGTTGAAATATATGTTGCTGC 
AAAGCAAGGTGAACCAGACCCCGAGTCAAACTCAGCTCTAAAATTCGTTT 
TGGACCGTGCTAAGCAAGCACAAGTTCCAAAGCATGTTATTGATAAAGCG 
ATTGATAAAGCCAAAGGAAACACAGATGAAACTTTCGTAGAGGGACGCTA 

1 5 TGA AGGTTTTGGTCC AA ATGGTTC A ATGATTATTGTGG AT ACTTTGAC ATC 
AAATGTTAACCGTACGGCAGCAAATGTACGTACTGCTTACGGTAAGAACG 
GTGGCAATATGGGAGCTTCAGGATCGGTATCCTACTTATTTGATAAAAAAG 
GTGTCATCGTTTTTGCTGGTGATGATGCTGACACTGTCTTCGAACAATTACT 
TGAAGCGGATGTAGACGTAGATGATGTTGAAGCAGAAGAGGGAACAATA 

20 ACAGTTTATACCGCCCCAACAGATCTTCATAAAGGTATCCAAGCACTTCGC 
GATAATGGTGTAGAAGAATTCCAAGTTACTGAACTTGAAATGATTCCTCAA 
TCAGAAGTAGTATTGGAAGGTGATGACCTTGAAACTTTTGAAAAGCTT 

(SEQ ID NO: 50) 

25 MGRKWANIVAKKTAKDGANSKVYAKFGVEIYVAAKQGEPDPESNSALKFVL 
DRAKQAQVPKHVIDKAIDKAKGNTDETFVEGRYEGFGPNGSMirVDTLTSNV 
NRTAANVRTAYGKNGGNMGASGSVSYLFDKKGVIVFAGDDADTVFEQLLEA 
DVDVDDVEAEEGTITVYTAPTDLHKGIQALRDNGVEEFQVTELEMIPQSEVVL 
EGDDLETFEKL 

30 

Sequence description: 

A] Length: 654 bp - 218 aa (Partial sequence) 
35 B] Possible Shine Dalgarno sequence upstream 

of ATG start, no obvious signal peptide 



40 ID-101 



Clone RS-80 



(SEQIDNO: 51) 

TTGGAGAAATATTTGAAGAACCCGATTACATGGATTGGATTAGTTCTTGTG 
GTTACGTGGTTTTTAACTAAAAGTAGTGAATTTTTGATTTTTGGTGTGTGTG 
TCTTGTTGTTAGTATTTGCTAGTCAAAGTGAT 

(SEQ ID NO: 52) 

MEKYLKNPITWIGLVLVVTWFLTKSSEFLIFGVCVLLLVFASQSD 



Sequence description: 

A] Length: 1 35 bp - 45 aa (partial sequence) 

B] Shine Dalgarno sequence upstream of TTG 
start codon with possible signal peptide 
evident at N-terminus. 



ID- 102 
Clone RS-81 
(SEQ ID NO: 53) 

ATGACACAATCAGATGCATATCTCTCGTTGAACGCGAAGACACGCTTTAGA 

GATCGCACAGGTAATTATCATTTTACTTCGGATAAAGAGGCTGTTGAACAA 

TATATGATAGAACATGTTGAACCTAATACGATGGTGTTCACATCACTAATT 

GAAAAGCTAGATTATTTGGTTTCTAATAACTACTATGAATCGGACCTTCTA 

AAACAATATAACCTTGAGTTTATTTGCCAAATTTTTGAGCATGCATAGGCT 

AAGAAATTTGCTTTTCTAAATTTTATGGGGGCTTTAAAATTTTATAATGCTT 

ATGCTCTTAAT 

(SEQ ID NO: 54) 

MTQSDAYLSLNAKTRFRDRTGNYHFTSDKEAVEQYMIEHVEPNTMVFTSLIE 

KLDYLVSNNYYESDLLKQYNLEFICQIFEHAYAKKFAFLNFMGALKFYNAYA 

LN 



Sequence description: 



A] Length: 318 bp - 106 aa (Partial sequence) 

B] Shine Dalgarno sequence present upstream of 
ATG start codon, no obvious signal peptide 



ID- 103 



Clone 2-11 A 

5 (SEQ ID NO: 55) 

ATGGTATTTATGGCAAATAAGAAAAAAACAAAAGGAAAGAAAACCAGAA 
GACCTACTAAGGCAGAAATAGAGCGTCAAAGAGCTATTCAAAGGATGATT 
ACTGCTCTTGTTTTAACAATTATTCTCTTCTTTGGTATTATCAGATTAGGTA 
TTTTTGGTATTACAGTCTATAACGTCATCCGTTTTATGGTAGGTAGCTTGGC 

1 0 TT ACTTATTT ATTGCGGC AACTTTAATCTACCTTT ATTTCTTTA A ATGGTTG 
CGAAAGAAAGATAGCTTAGTAGCAGGTTTTTTGATAGCTTCTTTAGGATTA 
TTGATTGAGTGGCATGCTTACCTTTTCTCAATGCCTATTTTGAAAGATAAA 
GAAATTTTGCGTTCAACTGCTCGATTAATTGTGTCTGATTTAATGCAATTTA 
AAATCACTGTTTTTGCCGGTGGAGGTATGTTGGGTGCTTTGATTTACAAGC 

1 5 C AATTGCTTTTCTCTTTTCT AATATTGGTGCCT AT ATGATTGGTGTTCTCTTC 
ATCATTTTGGGTCTCTTTTTAATGAGTTCTCTGGAAGTTTATGACATCGTCG 
AATTTATTAGAGCTTTTAAAAATAAAGTGGCAGAGAAGCACGAGCAAAAT 
AAAAAGGAGCGTTTTGCTAAGCGAGAGATGAAAAAAGCAATCGCTGAACA 
AGAGCGCATAGAGCGTCAAAAAGCTGAAGAAGAAGCTTATTTAGCTTCGG 

20 TTAATGTAGACCCTGAAACGGGTGAGATTCTAGAGGATCAAGCTGAGGAC 
AATTTGGATGATGCGCTACCACCTGAGGTAAGTGAAACATCAACTCCGGT 
ATTTGAGCCAGAGATCCTTGCTTATGAGACATCGCCTCAAAATGATCCTTT 
ACCAGTAGAGCCGACAATTTATTTAGAAGACTATGATTCGCCGATTCCTAA 
TATGAGAGAAAATGATGAGGAAATGGTTTATGATTTAGATGATGATGTAG 

25 ATGATAGTGATATAGAAAATGTCGACTTTACACCTAAAACGACACTGGTTT 
ATAAATTACCAACGATAGATTTATTTGCACCAGATAAGCCTAAAAATCAAT 
CCAAAGAAAAGGATTTAGTCCGAAAGAATATCAGAGTTTTAGAAGAAACA 
TTTAGAAGTTTTGGTATCGATGTAAAAGTAGAACGTGCTGAAATTGGACCA 
TCAGTTACTAAATATGAAATTAAACCAGCAGTTGGAGTTCGTGTGAATCGT 

30 ATTTCAAATCTATCTGACGACCTAGCTCTTGCTCTTGCAGCAAAAGATGTG 
CGTATAGAAGCACCAATTCCTGGAAAATCATTAATAGGTATTGAAGTTCCT 
AACTCAGAAATTGCAACGGTTTCTTTCCGCGAACTTTGGGAACAATCTGAT 
GCCAATCCTGAAAACCTTTTAGAAGTACCACTAGGAAAAGCTGTTAACGG 
CAATGCTCGCAGTTTTAACTTAGCTAGAATGCCGCATCTTTTGGTAGCTGG 

35 TTCAACTGGTTCAGGTAAATCTGTGGCAGTTAATGGAATTATTTCAAGTAT 
TTTGATGAAGGCACGTCCAGATCAAGTTAAGTTTATGATGATTGATCCCAA 
AATGGTTGAATTATCTGTTTATAATGATATTCCACATTTATTAATCCCTGTT 
GTAACCAATCCGCGTAAAGCAAGTAAGGCACTCCAAAAAGTTGTTGATGA 
AATGGAAAATCGATACGAGTTATTTAGCAAAATTGGTGTGGGTAATATAG 

40 CAGGTTATAATACAAAGGTTGAAGAGTTTAATGCTTCCTCTGAGCAAAAAC 
AAATGCCTTTGCCTTTAATCGTTGTCATTGTAGATGAATTGGCTGACTTGAT 
GATGGTTGCTAGTAAAGAAGTTGAAGATGCTATTATTCGTTTGGGGCAAAA 
AGCACGTGCTGCAGGTATCCATATGATTCTTGCAACTCAACGTCCATCCGT 
AGATGTTATTTCTGGTTTGATTAAAGCAAATGTTCCGTCGCGTATTGCATTT 



GCTGTTTCAAGTGGTACTGATAGCCGTACGATCCTTGATGAAAATGGTGCT 
GAAAAGCTCTTGGGACGGGGTGACATGCTCTTTAAGCCTATTGATGAGAAT 
CATCCAGTACGACTACAAGGTTCCTTTATTTCAGATGATGATGTTGAAAGG 
ATCGTTGGTTTTATCAAAGACCAAGCCGAGGCTGACTATGATGATGCCTTT 
5 GATCCTGGAGAAGTATCTGAAACAGATAACGGCTCTGGTGGTGGCGGCGG 
AGTACCTGAAAGTGATCCTCTTTTTGAAGAAGCCAAGGGACTCGTTTTAGA 
GACGCAAAAAGCAAGTGCCTCAATGATTCAACGCCGATTGTCTGTTGGTTT 
CAATAGAGCAACAAGACTAATGGAAGAATTAGAAGCAGCGGGGGTTATTG 
GTCCAGCAGAAGGAACCAAGCCACGAAAAGTTTTAATGACTCCAACTCCG 
10 AGTGAATAA 

(SEQ ID NO: 56) 

MVFMANKKKTKGKKTRRPTKAEffiRQRAIQRMITALVLTIILFFGnRLGIFGIT 
VYNVIRFMVGSLAYLFIAATLIYLYFFKWLRKKDSLVAGFLIASLGLLIEWHA 

1 5 YLFSMPILKDKEILRSTARLIVSDLMQFKITVFAGGGMLGALIYKPIAFLFSNIG 
AYMIGVLFIILGLFLMSSLEVYDIVEFIRAFKNKVAEKHEQNKKERFAKREMK 
KAIAEQEREERQKAEEEAYLASVNVDPETGEELEDQAEDNLDDALPPEVSETST 
PVFEPEILAYETSPQNDPLPVEPTIYLEDYDSPIPNMRENDEEMVYDLDDDVDD 
SDENVDFTPKTTLVYKLPTIDLFAPDKPKNQSKEKDLVRKNIRVLEETFRSFGI 

20 DVKVERAEIGPSVTKYEIKPAVGVRVNRISNLSDDLALALAAKDVRIEAPIPGK 
SLIGffiVPNSEIATVSFRELWEQSDANPENLLEVPLGKAVNGNARSFNLARMPH 
LLVAGSTGSGKSVAVNGIISSILMKARPDQVKFMMIDPKMVELSVYNDIPHLLI 
PVVTNPRKASKALQKVVDEMENRYELFSKIGVRNIAGYNTKVEEFNASSEQK 
QMPLPLIVVrVDELADLMMVASKEVEDAIIRLGQKARAAGIHMILATQRPSVD 

25 VISGLIKANVPSRIAFAVSSGTDSRTILDENGAEKLLGRGDMLFKPIDENHPVRL 
QGSFISDDDVERIVGFIKDQAEADYDDAFDPGEVSETDNGSGGGGGVPESDPL 
FEEAKGLVLETQKASASMIQRRLSVGFNRATRLMEELEAAGVIGPAEGTKPRK 
VLMTPTPSE* 

30 

Sequence description: 

A] Length: 2451 bp - 817 aa (Full-length gene) 

B] Shine Dalgarno sequence present upstream of 
35 ATG start codon, possesses a potential signal 

peptide 

ID- 104 

40 

Clone 2-18/22b 
(SEQ ID NO: 57) 



ATGTCACAAGAGCAAGGAAAAATTTATATTGTAGAAGATGATATGACGAT 

TGTGTCACTTTTAAAAGATCATTTATCAGCTAGCTATCATGTCTCTAGTGTC 

AGCAATTTTCGTGATGTGAAACAAGAAATTATCGCATTTCAACCCGATTTG 

ATACTAATGGATATTACGTTACCCTATTTTAATGGTTTTTACTGGACTGCAG 

AATTGCGTAAGTTTTTAACAATTCCTATTATTTTCATTTCATCTAGTAATGA 

TGAAATGGATATGGTTATGGCATTAAATATGGGGGGTGATGACTTTATTTC 

AAAACCATTCTCTCTAGCTGTATTAGATGCTAAGCTAACTGCTATTTTAAG 

GAGAAGTCAACAATTTATCCAACAGGAATTAACTTTTGGGGGATTTACGTT 

GACAAGAGAAGGGTTATTGTCTAGCCAAGATAAAGAGGTTATTTTATCGC 

CAACAGAAAATAAAATCCTATCTATCTTGCTCATGCATCCTAAACAAGTAG 

TCTCAAAAGAGTCTCTATTAGAGAAACTTTGGGAAAATGATAGTTTTATTG 

ATCAAAATACACTTAATGTTAATATGACACGCTTACGTAAAAAAATTGTCC 

CAATAGGTTTTGATTACATTCATACAGTGAGAGGAGTTGGGTATTTACTAC 

AATGA 

(SEQ ID NO: 58) 

MSQEQGKIYIVEDDMTIVSLLKDHLSASYHVSSVSNFRDVKQEIIAFQPDLILM 

DITLPYFNGFYWTAELRKFLTIPIIFISSSNDEMDMVMALNMGGDDFISKPFSLA 

VLDAKLTAILRRSQQFIQQELTFGGFTLTREGLLSSQDKEVILSPTENKILSILLM 

HPKQVVSKESLLEKLWENDSFIDQNTLNVNMTRLRKKIVPIGFDYIHTVRGVG 

YLLQ* 



Sequence description: 

A] Length: 669 bp - 223 aa (full-length gene 
sequence) 

B] Shine Dalgarno sequence present upstream of a GTG start codon. 
Was not identified directly by LEEP. This gene was found upstream of 
gene ID- 10 described in WO 00/06736. 



ID- 105 
Clone 2-20 
(SEQ ID NO: 59) 

ATGTATCAAACTCAGACAAATAAGGAAAAATTTGTTTTATTTTTGAAATTA 

TTTATGCCAGTATTGATTTATCAATTTGCTAATTTTTGAGGTAGTTTTATTGA 

TTCGGTTATGACTGGACAGTATAGTCAGCTACATTTGGCAGGTGTGTCAAC 

TGCTAGTAATTTATGGACTCCGTTTTTCGCTTTATTAGTAGGTATGATTTCA 

GCATTAGTACCAGTAGTTGGTCAACATTTGGGTAGAGGAAATAAAGAACA 

AATTCGCACAGAATTTCATCAATTTCTATATTTAGGTTTGATACTGTCCTTA 

ATATTATTTTTAATCATGCAATTTATTGCTCAACCTGTCTTGGGGAGTTTGG 



GTTTAGAAGATGAAGTTCTAGCAGTTGGTCGTGGTTATTTAAATTATATGT 
TGATTGGAATCATGCCGCTGGTGTTGTTTAGCATTTGCCGTTCATTCTTTGA 
TGCATTGGGGTTAACAAGGTTATCTATGTATCTGATGCTTTTAATTCTACCC 
TTTAATTCATTTTTTAATTATATGCTTATCTACGGTAAATTTGGTATGCCTA 
5 GACTAGGAGGTGCGGGGGCAGGTCTTGGAACTTCTTTAACTTATTGGGCTA 
TTTTTATTGGTATTATTATTGTGATGTCACTTCATCCTCAAATTAAAACATA 
TCATATATGGACTCTGGAAAGAATAAAAGCTCCTTTGATTATTGAAGATAT 
TCGATTGGGATTACCGATTGGTTTACAAATTTTTGCAGAAGTTGCAATTTTT 
GCAGTAGTAGGCTTATTCATGGCAAAATTTTCTTCAATCATTATTGCAGCA 

1 0 C ATC AGGCTGCTATG A ATTTTTC ATCATT A ATGT ATGC ATTTCCTTT AAGTA 
TTTCCACTGCTCTAGCTATTACAATATCGTTTGAAGTAGGGGCAGAGCGCT 
TTCAGGACGCAACCACTTATAGTAGGATAGGACGCTTAACAGCGGTAGGG 
ATTACATCAGGAACCTTACTATTTTTATTTCTATTTCGTGAGAATGTAGCAG 
CAATGTATAATAGTGCCCCTCACTTTGTCGCTATTACAGCTCAATTCCTAAC 

1 5 TTATAGTCTCTTTTTCC AGTTTGC AGATGCTTATGC AGCTCCTGTAC AGGGG 
ATTTTACGAGGCTATAAGGATACAACAAAACCATTTATGATCGGTGCGGG 
CTCTTATTGGTTATGTGCTTTGCCATTAGCGGTTATCTTAGAAAAAAATAG 
CCAGTTAGGTCCGTTTGCCTATTGGATTGGTTTAATCACAGGTATTTTTGTT 
TGTGGTCTATTTCTAAACCAACGTCTGCAAAAGATTAAGAAGTTGTATTAT 

20 TAA 

(SEQ ID NO: 60) 

MYQTQTNKEKFVLFLKLFIPVLIYQFANFSATFIDSVMTGQYSQLHLAGVSTAS 
NLWTPFFALLVGMISALVPVVGQHLGRGNKEQIRTEFHQFLYLGLILSLILFLI 

25 MQFIAQPVLGSLGLEDEVLAVGRGYLNYMLIGIMPLVLFSICRSFFDALGLTRL 
SMYLMLLILPFNSFFNYMLIYGKFGMPRLGGAGAGLGTSLTYWAIFIGIIIVMS 
LHPQIKTYHIWTLERIKAPLIffiDmLGLPIGLQIFAEVAIFAVVGLFMAKFSSIIIA 
AHQAAMNFSSLMYAFPLSISTALAITISFEVGAERFQDATTYSRIGRLTAVGITS 
GTLLFLFLFRENVAAMYNSAPHFVAITAQFLTYSLFFQFADAYAAPVQGILRG 

30 YKDTTKPFMIGAGSYWLCALPLAVILEKNSQLGPFAYWIGLITGIFVCGLFLNQ 
RLQKIKKLYY* 



Sequence description: 

35 

A] Length: 1341 bp - 447 aa (full length gene) 

B] Shine-Dalgarno sequence present upstream of 
ATG start codon, There is a potential signal 
peptide sequence 

40 



ID- 106 



Clone 2-4 A 



(SEQIDNO: 61) 

TTGCTAGTTTCTTCTCTAGTTTCTTGTTCATTTTTTCTTGTCATTTCGTCGTT 
5 GTCTTCATCAACACGAAATAAGTCTATAAACTTATCAAATAATTTCATAGA 
CTTATTATATCAATTTTCAATAAAATGCTATAATAAAACCATGTCATTTTCA 
TTAAAAATTAGAAATCCATACGGTGAACATACCGTTAAAGAACTCCTTGA 
AGATTATTTTTTGATTCCACGTAAGATTAGACATTTTTTGCGTGTTAAAAAA 
CATGTACTTATAAACAATGAATTCATTAATTGGCAAACTGTCGTCCAAGAA 

1 0 AACGATACT ATT ACCTTAATCTTTGATG ATGAGG ATT ACCCTACTA A AAA A 
ATTCCTCTGGGCAGAGCAGAGCTTATTGATTGTCTTTATGAGGATGAACAT 
CTTATTATCGTTAATAAACCTGAAGGTATGAAAACTCACGGTAACCAACCA 
AATGAAATAGCACTGTTAAATCATGTATCTGCCTATTCTGGACAAACATGC 
TATGTTGTTCATCGCCTAGATATGGAGACCAGTGGAGCTGTTTTATTTGCT 

1 5 AAAAATCC ATTTATACTTCCCCTTATCAATCAACGCTTAGAACGAAAAGAA 
ATTTGGCGTGAATATTGGGCTTTAGTTGAAGGAAAATTTTCACCTAAGCAT 
CAAGTTTTGAGAGACAAAATTGGACGGAACCGTCATGACAGACGTAAACG 
AATCATTGATTCTAAAAACGGTCAACATGCTATGACAATCATTGACGTTTT 
GAAGTATATCCAAAATAGTAGTCTCATAAAATGCCGACTGGAAACCGGAA 

20 GAACCCATCAAATTCGCATTCACTTATCTCATCACGGACATCCTTTAATAG 
GAGATCCCCTCTACAACCCTTCTTCTAATAATGAAAGGTTAATGCTACACG 
CTCACCGATTGACTCTATCCCATCCATTAACTTGCGAAACTATTAGCGTAG 
AGGCCCCTTCATCTACTTTCGAGAAGGTTTTAAACAATTATAAAAAAGGAG 
TTGGATAA 

25 

(SEQIDNO: 62) 

MLVSSLVSCSFFLVISSLSSSTRNKSINLSNNFIDLLYQFSIKCYNKTMSFSLKIR 
NPYGEHTVKELLEDYFLIPRKIRHFLRVKKHVLINNEFINWQTVVQENDTrrLIF 
DDEDYPTKKIPLGRAELIDCLYEDEHLUVNKPEGMKTHGNQPNEIALLNHVSA 
30 YSGQTCYVVHRLDMETSGAVLFAKNPFILPLINQRLERKEIWREYWALVEGKF 
SPKHQVLRDKIGRNRHDRRKRIIDSKNGQHAMTIIDVLKYIQNSSLIKCRLETG 
RTHQIRIHLSHHGHPLIGDPLYNPSSNNERLMLHAHRLTLSHPLTCETISVEAPS 
STFEKVLNNYKKGVG* 

35 

Sequence description: 

A] Length: 1029 bp - 343 aa (Full length gene sequence) 

B] No obvious Shine-Dalgarno sequence upstream 

40 of the putative TTG start codon. Possesses a 

potential leader peptide sequence. 



ID- 107 



Clone 2-54 
(SEQ ID NO: 63) 

GAACTAAATGCAACTCAACCTAATAATAGAACTACCTATATTATACCCGAA 

AGCAGTCATTCCATTGCAGAACAACAGAGATTCCTGATAGAATCAAAGGG 

TTCTTCGGTTGCATTACTTAATAGCGATGAATTTAGAAAGACAGCGGGAGA 

GGATAGAGGTTTTGAAAGGGATAAGTTGAGGTCTTTGGATATCATTCCTAA 

GGGAGATTTATCGACAAGTAATGTCATAGGTAATACGGACATTGCTAGTC 

AGATATCGTTGGGCTTTAAAAAGAATGCGATGCAGGAACACCATCTTACT 

AAAACATTCTCTCAAAAGGATGGAAAGTTATCGTCTGTTATAGAGGGGAT 

GCTTGCTATTGGCAAAGAGAAAGTAGAGAAAGAAATAAAATATAGTGGTA 

ATTTATGGCAAAAATTAAAAGCTAAGGCACACTGCCTTGTTTGCTGTGTTG 

ATAATTTGAATTTTGAAGATATAAAATCTTATTTTCAATATTATTGTCATCT 

AAACCATCAGCTCAAATTACCTAAAGGTGCTATACTTTCTGCTAAAACAGA 

AGTATATAGGGGAGGAGATTTTGGGAGAAAAAATAAAGATAATGTGTTTG 

GTTACCGTATCCCCTCATTATTGAAAACCCAAAAAGGAACTTTACTTGCGG 

GAGCTGATGAAAGAATTGAGCAAGCTTGTGATTGGGGAAACATAGGAATG 

GTTATTCGCCGTAGTGAGGATGATGGTGTCACTTGGGGAAAAAGAGAAAC 

TATTGTCAATCTCCGTAATAACCCTAGAGTTCCGCTAGTTACTAGTGGTGA 

CTATAGTGGCTCACCTATTAATATGGATATGGCATTAGTTCAAGATACTAG 

CTCCAAGACGAAACGTATTTTTTCAATATATGATATGTTTCCAGAAGGAAG 

AGGCGTTATTAGTATTGCTAACACACCTGAAAAAGAATATACCCAAATCG 

GAGGACAGTCTTATCTTAATTTATATAATAATGGAAAGAAATCGAAGGTTT 

TTACTATCCGTGACAAAGGTATTGTATATAATTTTAAAGGGAAAAAGACTG 

ATTATCATGTTATAACAGAAACTACTAAAAGTGACCATTCAAATCTAGGGG 

ATATTTATAAGGGAAAACAGCTACTTGGAAATATATATTTTACAAAACATA 

AAACGTCACCATTTCGTTTAGCAAAATCAAGCTATGTGTGGATGTCATATA 

GCGATGATGATGGTAGGACATGGTCATCACCTAGAGATATAACAGCAAGT 

CTTCGTCAGAAAGGCATGAAATTTTTGGGAATAGGACCTGGAAAAGGTAT 

AGTTTTAAAATGGGGGCCACACGCTGGTCGTATTATTATTCCTGCCTATTCT 

ACGAATTGGAAATCTCATCTAAGAGGTTCACAATCTTCACGCCTAATTTAT 

TCAGACGACCATGGAAAAACGTGGCATACTGGAAAAGCAGTTAATGATAA 

CCGTATACTTTCTAATGGTGAAAAAATTCACTCCTTAACAATGGATAATAA 

AAAAGAACAAAATACAGAATCCGTACCCGTTCAATTGAAAAATGGGGACA 

TTAAGTTATTTATGAGGAATCTAACTGGTAACCTAGAAGTAGCCACAAGTA 

AAGACGGCGGGGAGACTTGGCAAAACCATGTTAAACGATATAAGGAAATT 

CATGATGCTTACGTCCAACTATCAGCTATTCGCTTTGAGCATGACAAAAAA 

GAGTATATTTTATTAGTGAATGCTAATGGGCCAGGGAAGAAGTGCCAAGA 

TGGATATGCACGTCTAGCGCAAGTTAATCGAAATGGTAGTTTTAAGTGGTT 

ATATCACCATCACATTCAAGATGGTTCGTTTGCTTACAACTCTGTTCAACA 

ACTTAATAATGATCAATTTGGTGTCCTTTATGAACATAGAGAAAAACATCA 

AAATAGTTTTACTTTAAATTACAAAGTTTTTAATTGGAGTTTTCTTAGTCAA 



AATACAGAGAAGCAAGGCACTTTATGGGAGAAAATGGCAGCAAATTGGCA 
TGTTTTGTTTAAATTTTATTTATGA 

(SEQ ID NO: 64) 

ELNATQPNNRTTYIIPESSHSIAEQQRFLIESKGSSVALLNSDEFRKTAGEDRGF 

ERDKLRSLDIIPKGDLSTSNVIGNTDIASQISLGFKKNAMQEHHLTKTFSQKDG 

KLSSVIEGMLAIGKEKVEKEIKYSGNLWQKLKAKAHCLVCCVDNLNFEDIKS 

YFQYYCHLNHQLKLPKGAILSAKTEVYRGGDFGRKNKDNVFGYRIPSLLKTQ 

KGTLLAGADERIEQACDWGNIGMVIRRSEDDGVTWGKRETIVNLRNNPRVPL 

VTSGDYSGSPINMDMALVQDTSSKTKRIFSIYDMFPEGRGVISIANTPEKEYTQI 

GGQSYLNLYNNGKKSKVFTIRDKGIVYNFKGKKTDYHVITETTKSDHSNLGDI 

YKGKQLLGNI YFTKHKTSPFRLAKS S Y VWMS YSDDDGRTWS SPRDIT AS LRQ 

KGMKFLGIGPGKGIVLKWGPHAGRIIIPAYSTNWKSHLRGSQSSRLIYSDDHG 

KTWHTGKAVNDNRILSNGEKIHSLTMDNKKEQNTESVPVQLKNGDIKLFMRN 

LTGNLEVATSKDGGETWQNHVKRYKEIHDAYVQLSAIRFEHDKKEYELLVNA 

NGPGKKCQDGYARLAQVNRNGSFKWLYHHHIQDGSFAYNSVQQLNNDQFG 

VLYEHREKHQNSFTLNYKVFNWSFLSQNTEKQGTLWEKMAANWHVLFKEYL 



Sequence description: 

A] Length: 2052 bp - 684 aa (partial gene sequence) 

B] N-terminus has yet to be determined 

ID- 108 
Clone 2-61 
(SEQ ID NO: 65) 

ATGCCTAAATTAATCGTATCTTTCCTCTGCATTTTATTATCCCTGACTTGTG 

TAAACTCTGTGCAAGCTGAAGAACATAAAGATATTATGCAAATTACCCGA 

GAAGCCGGATATGATGTTAAAGATATTAATAAACCTAAAGCGTCTATCGTT 

ATTGACAATAAAGGTCATATTTTGTGGGAAGATAACGCCGATTTAGAACGT 

GATCCCGCTAGCATGTCTAAAATGTTTACTTTATATTTACTATTTGAAGACT 

TAGCTAAAGGAAAAACAAACCTCAACACCACAGTGACTGCAACAGAAACA 

GACCAAGCCATAAGTAAGATTTATGAAATTAGTAATAACAATATTCATGCT 

GGGGTTGCTTATCCTATTCGTGAACTGATTACTATGACGGCTGTCCCGTCA 

TCTAATGTAGCAAGTATTATGATTGGTAACCACTTATCACAAAACAATGGT 

GACGCCTTTATTAAACGAATCAATGAAACCGCCAAGAAACTCGGTATGAC 

AAAAACTCACTTTTATAACCCCAGTGGGGCGGTAGCGAGTGCTTTTAATGG 

ACTTTACTCCCCAAAAGAATACGATAACAATGCTACTAACGTTACGACTGC 

ACGTGATCTATCAATTTTAACCTATCATTTCCTTAAAAAATACCCTGATATA 

CTGAACTATACAAAATATCCTGAAGTCAAGGCCATGGTCGGAACTCCTTAT 



GAAGAAACATTTACAACTTATAACTACTCTACCCCCGGCGCTAAATTTGGA 
TTAGAAGGAGTAGATGGCTTAAAAACTGGTTCTAGCCCTAGCGCTGCTTTT 
AATGCCTTAGTTACAGCTAAACGCCAGAATACTCGCTTGATAACTGTGGTT 
TTAGGAGTTGGCGATTGGTCAGACCAAGACGGAGAGTACTATCGTCATCC 
5 GTTTGTCAACGCTCTTGTAGAAAAAGGTTTTAAAGACGCTAAAAATATTTC 
TTCTAAAACTCCTGTATTAAAAGCCGTTAAACCTAAAAAAGAAGTTACTAA 
AACCAAAACTAAATCTATTCAAGAACAGCCTCAAACAAAAGAACAGTGGT 
GGACAAAAACAGATCAATTTATCCAATCACATTTTGTATCTATTTTAATTG 
TTCTGGGCACCATCGCTAGCCTTTGTCTTTTAGCTGGGATAGTATTACTTAT 
10 AAAGCGCTCTAGATAA 

(SEQ ID NO: 66) 

MPKLIVSFLCILLSLTCVNSVQAEEHKDIMQITREAGYDVKDINKPKASrVIDN 
KGHILWEDNADLERDPASMSKMFTLYLLFEDLAKGKTNLNTTVTATETDQAI 

1 5 SKIYEISNNNfflAGVA YPIRELITMTAVPSSNVATIMIANHLSQNNPDAFIKRINE 
TAKKLGMTKTHFYNPSGAVASAFNGLYSPKEYDNNATNVTTARDLSILTYHF 
LKKYPDILNYTKYPEVKAMVGTPYEETFTTYNYSTPGAKFGLEGVDGLKTGS 
SPSAAFNALVTAKRQNTRLITVVLGVGDWSDQDGEYYRHPFVNALVEKGFK 
DAKNISSKTPVLKAVKPKKEVTKTKTKSIQEQPQTKEQWWTKTDQFIQSHFVS 

20 ILP/LGTIASLCLLAGIVLLIKRSR* 



Sequence description: 

25 A] Length: 1 188 bp - 396 aa (full length gene) 

B] Shine Dalgarno sequence present upstream of 
ATG start codon, possesses a potential signal 
peptide 

30 

ID-109 
Clone 45 

35 (SEQ ID NO: 67) 

ATGACTGAAAAATATTATAATTGGGCAACGCTTGGAACCGGCGTTATTGCC 
AACGAATTAGCCCAAGCACTGGAAGCACGTGGACAAAAATTATATTCTGT 
AGCTAATAGAACTTACGACAAAGGACTTGAATTTGCTAACAAATATGGTA 
TGCAAAAAGTTTATGATGAGATAGATCAAGTATTTGAAGAGGCTGAAGTGG 

40 ATATCATTTATATCTCTACTCCCCACAATACTCACATCTCATTTTTACGAAA 
GGCTTTAGCAAATGGTAAGCACGTTCTTTGCGAAAAATCTATTACTTTAAA 
TAGTACTGAGCTTAAAGAAGCCATAGATTTAGCCGAAACTAACCATGTTGT 
CTTAGCTGAAGCCATGACTATTTTTCATATGCCAATTTACCGCCAATTAAA 
AACATTAGTTGATAGTGGAAAATTAGGACCGTTAAAAATGATTCAAATGA 



ATTTCGGAAGTTATAAAGAATATGATATGACTAACCGTTTTTTCAGTCGTG 

ACCTAGCAGGCGGTGCTTTGCTGGACATTGGTGTTTATGCACTTTCTTGTAT 

TCGCTGGTTTATGTCAGAAGCACCTCACAACATTACCTCTCAAGTTACATT 

TGCACCAACAGGGGTTGATGAACAAGTTGGTATCCTACTAACCAACCCAG 

CAAATGAGATGGCGACTGTCAGCCTTAGTTTACATGCAAAACAACCTAAA 

CGAGCAACTATCGCTTACGATAAAGGCTACATTGAACTTTTTGAATATCCG 

CGAGGACAAAAGGCAGTTATTACTTATACTGAGGATGGGCATCAAGATAT 

TATCGAAGCTGGCAAAACTGAAAATGCTCTCCAATATGAGGTAGCTGATA 

TGGAAGAAGCCATTTCAGGAAAAACTAACCACATGTACTTAAACTATACC 

AAAGATGTTATGGATATCATGACACAGCTACGTCAAGAATGGGGATTTAC 

CTACCCAGAAGAAGAAAAATGA 

(SEQ ED NO: 68) 

MTEKYYNWATLGTGVIANELAQALEARGQKLYSVANRTYDKGLEFANKYGI 

QKVYDHEDQVFEDPEVDIIYISTPHNTHISFLRKALANGKHVLCEKSITLNSTEL 

KEABDLAETNHVVLAEAMTIFHMPIYRQLKTLVDSGKLGPLKMIQMNFGSYK 

EYDMTNRFFSRDLAGGALLDIGVYALSCERWFMSEAPHNITSQVTFAPTGVDE 

QVGILLTNPANEMATVSLSLHAKQPKRATIAYDKGYEELFEYPRGQKAVITYT 

EDGHQDIIEAGKTENALQYEVADMEEAISGKTNHMYLNYTKDVMDIMTQLR 

QEWGFTYPEEEK* 



Sequence description: 

A] Length: 984 bp - 328 aa (full length gene) 

B] Shine Dalgarno sequence present upstream of 
ATG start codon, possesses a potential signal 
peptide 

ID-110 

Clone 2-2 

(SEQ ID NO: 69) 

GTGTATTCTCCTGTTAAATCTTCTAAAGGAAAAGTGATATTGTTAAAAAGT 
GATTTTCTAAAGAGCTTCATAGAAAGGAGAGGAAATATTTGTTTT 

(SEQ ID NO: 70) 

MYSPVKSSKGKVELLKSDFLKSFffiRRGNICF 



Sequence description: 



A] Length: 96 bp - 32 aa (partial sequence) 

B] GTG start codon - no obvious Shine-Dalgarno 



sequence 

Possesses a potential signal peptide 



ID-Ill 

Clone 2-3 

(SEQ ID NO: 71) 

AAATACTGTATCATTGCAACCTCAAATGCAGGTTTTGGAAACGAAGCATTT 

ACAGGTGACAGCGATAAAGACTTGAAAATTATGGAACGAATTTCTCCATA 

TTTCCGTCCAGAATTTCTAAATCGTTTCAATGGTGTTATTGAATTCTCTCAC 

CTAAGCAAAGATGACTTAAGCGAAATTGTAGATTTGATGCTTGATGAAGTT 

AACCAAACAATTGGCAAAAAAGGAATTGACCTTGTGGTAGATGAAAATGT 

TAAATCACACTTAATTGAACTGGGTTATGACGAAGCAATGGGAGTACGTC 

CATTGCGCCGTGTCATCGAGCAAGAAATTCGAGATCGCATCACAGACTACT 

ATCTCGATCATACAGACGTTAAACACCTAAAAGCTAATTTGCAAGATGGCC 

AAATCGTCATTTCTGAAAGATAA 

(SEQ ID NO: 72) 

KYCIIATSNAGFGNEAFTGDSDKDLKIMERISPYFRPEFLNRFNGVIEFSHLSKD 
DLSEIVDLMLDEVNQTIGKKGIDLVVDENVKSHLIELGYDEAMGVRPLRRVIE 
QEIRDRITDYYLDHTDVKHLKANLQDGQIVISER* 



Sequence description: 

A] Length: 429 bp - 143 aa (partial sequence) 

B] N-terminus yet to be elucidated. This gene 
was not in frame with nuc 



ID-112 
Clone 2-5 

- (SEQ ID NO: 73) - 

ATGTCAATGAATTTTTCATTTTTACCACAATATTGGTCCTATTTTAATTATG 

GTGTGATGGTAACCATTATGATTTCAACATGTGTTGTTTTTTTTGGAACTAT 

TATAGGCGTGTTAATTGCTTTAGTAAAGCGTACTAATTTACATTTTCTCACA 

ATATTAGCTAATTTCTATGTATGGGTATTTCGTGGGACACCGATGGTAGTT 

CAAATTATGATTGCTTTCGCATGGATGCATTTTAACAATTTACCAACAATT 



AGCTTTGGTGTTTTAGATTTAGATTTTACACGACTTTTACCTGGTATCATTA 

TCATTTCCTTAAATAGTGGTGCCTATATTTCGGAAATTGTACGTGCAGGGA 

TTGAGGCTGTACCATCTGGACAAATAGAAGCAGCTTACTCGTTGGGGATTC 

GACCTAAAAATACACTTCGCTATGTTATCTTACCCCAAGCTTTTAAAAATA 

TTTTACCTGCTCTAGGGAATGAATTTATTACAATTATTAAAGATAGTGCTCT 

CCTTCAAACTATTGGTGTCATGGAATTATGGAACGGAGCACAATCAGTTGT 

AACGGCTACTTACTCACCAGTTGCACCGTTATTATTTGCAGCATTTTACTAT 

TTAATGTTGACAACGATTCTCTCAGCTTTGTTAAAACAAATGGAGAAATAT 

CTTGGGAAAGGGGTAAAAATAGATGGTTGA 

(SEQ ID NO: 74) 

MSMNFSFLPQYWSYFNYGVMVTIMISTCVVFFGTIIGVLIALVKRTNLHFLTIL 
ANFYVWVFRGTPMVVQIMIAFAWMHFNNLPTISFGVLDLDFTRLLPGmiSLNS 
GAYISEIVRAGIEAVPSGQIEAAYSLGIRPKNTLRYVILPQAFKNELPALGNEFrn 
IKDSALLQTIGVMELWNGAQSVVTATYSPVAPLLFAAFYYLMLTTILSALLKQ 
MEK YLGKG VKIDG * 



Sequence description: 

A] Length: 699 bp - 233 aa (full length gene) 

B] Shine-Dalgarno sequence preceded the 'ATG' 
start codon. Possesses a potential leader peptide 
sequence. 

ID-113 

Clone 2-7 

(SEQ ID NO: 75) 

ATGAAAGACCTATTACGAAATAGTCTAGAGCAAAGTGGAAATTTAAGTTT 

TCAAGATATGATTTTACATATTCTTGTAGCAGCTTTATTGAGTGTAGTTATT 

TATGTTTCCTATGCTTATACGCATAGTGGAACTGCCTATAGTAAAAAGTTT 

AATGTTTCATTAATGACATTGACGGTCTTGACTGCAACAGTAATGACCGTT 

ATTGGTAATAATGTAGCCTTGTCATTGGGTATGGTCGGTGCCTTGTCAGTT 

GTTCGTTTTAGGACAGCCATAAAAGATTCAAGAGATACAGTTTATATTTTT 

TGGACCATAGTTGTTGGTATCTGTTGTGGTGTCGGTGACTATGTGGTAGCT 

GCATTAGGAAGTAGCGTTATOITTATCTTATTATGGGTTATGGGACGTGTT 

AAAAACGAGAATCGTATGTTATTGATTGTGAAGTGCGATAGAACACTAGA 

AGTTGATTTAGAAGGAATTTTCTTCCAATATTTTGACGGAAAAGCTGTTCA 

GCGTGTTAAAAATTCAACAACTAATACTATTGAAATGATTTTCGAAATCTC 

TAGAAAAGATTACGATAAGCAACTCCATGTAGATAATCAGTTAACTGAAA 



AAGTGTACCAATTGGGAAATATTGATTATTTCAACATTGTTAGCCAAAGCG 
ACGAAATCAATGGGTAG 



(SEQ ID NO: 76) 

5 MKDLLRNSLEQSGNLSFQDMELHELVAALLSVVIYVSYAYTHSGTAYSKKFNV 
SLMTLTVLTATVMTVIGNNVALSLGMVGALSVVRFRTAIKDSRDTVYIFWTIV 
VGICCGVGDYVVAALGSSVIFILLWVMGRVKNENRMLLIVKCDRTLEVDLEGI 
FFQYFDGKAVQRVKNSTTNTIEMIFEISRKDYDKQLHVDNQLTEKVYQLGNID 
YFNIVSQSDEING* 

10 

Sequence description: 

A] Length: 678 bp - 226 aa (full-length gene) 
15 B] ATG start codon is preceded by a Shine- 

Dalgarno sequence-Possesses a potential leader 
peptide sequence 

20 ID- 114 

Clone 2-8 

(SEQ ID NO: 77) 

25 AAAAATTCATTTTAGATTCATTTTACGACTATATACTCAGAAGTACCAAAC 
CTAATCCAAGGTTTGAAAAAAGAAAGAAGGAAGTCAGTATGACAAACTAT 
AAAAACAACTTTAAAGATGAGGCTATACGTGTTGAAGAGACAACAAAAGA 
ATCATTTTACGATGTTGATATTGCCTTGTTTTCAGCTGGTGGATCTATTTCA 
GCAAAGTTCGCTCCTTATGCAGTAAAGTCTGGAGCAGTTGTAGTAGATAAC 

30 ACGTCATATTTTCGTCAGAATCCTGATGTTCCACTAGTTGTTCCTGAAGTAA 
ATGCTCATGCCATGATTGGTCATAATGGTATCATAGCTTGTCCCAATTGTTC 
TACTATTCAAATGATGATTGCTTTAGAGCCCATTCGTCAAAAATGGGGGAT 
AGAGCGTGTTATAGTTTCCACCTATCAAGCTGTTTCGGGTTCAGGTGCACG 
TGCTGTTGAAGAAACTAAGGAACAGTTGAGACAAGTTTT 

35 

(SEQ ID NO: 78) 

KF1LDSFYDYILRSTKPNPRFEKRKKEVSMTNYKNNFKDEAIRVEETTKESFYD 
VDIALFSAGGSISAKFAPYAVKSGAVVVDNTSYFRQNPDVPLVVPEVNAHAMI 
GHNGIIACPNGSTIQMMIALEPIRQKWGIERVIVSTYQAVSGSGARAVEETKEQ 
40 LRQV 



Sequence description 



A] Length: 499 bp - 165 aa (partial sequence) 

B] N-terminus has yet to be determined 



5 ID-115 

Clone 2-9 

(SEQ ID NO: 79) 

1 0 ATG ACAA ATG AATTG ATA ATGC AAGCTTTTG AGTGGT ATTTACCT AGTGAT 
GGGAATCACTGGAAGAAATTAGAGGAGTCTATATCAGACCTTAAAAAACT 
TGGAATTAGTAAAATCTGGTTACCACCAGCATTTAAGGGAACTAGCAGTG 
ATGATGTAGGATATGGTGTTTATGATCTCTTTGATTTAGGAGAATTTGACC 
AGAATGGAACAATTAGAACAAAATATGGTAGGAAAGAAGAGTATCTAAA 

1 5 GCTT ATTAAGTCGTT A AAGGC AAATGGC ATTA A ACCGTTTGC AG AT ATCGT 
TCTTAACCATAAAGCCAATGGTGATCATAAAGAAAAATTTCAAGTCATCA 
AAGTCAATCCTGAAAATCGTCAAGAAGCATTAAGTGAACCCTATGAGATT 
GAAGGATGGACGGGATTTGATTTCCCAGGTAGACAGGGTGAGTACAATGA 
TTTT 

20 

(SEQ ID NO: 80) 

MTNELIMQAFEWYLPSDGNHWKKLEESISDLKKLGISKIWLPPAFKGTSSDDV 
GYGVYDLFDLGEFDQNGTIRTKYGRKEEYLKLIKSLKANGIKPFADIVLNHKA 
NGDHKEKFQVIKVNPENRQEALSEPYEIEGWTGFDFPGRQGEYNDF 

25 

Sequence description: 

A] Length: 456 bp - 152 aa (partial sequence) 
30 B] ATG start codon is preceded by a Shine- 

Dalgarno sequence, no leader peptide sequence. 



35 ID-116 

Clone 2-10 

(SEQ ID NO: 81) — - - - - - 

40 ATGGAGGTTCTTATGAAGAAAGTGTTAGTAAGTAGTCTTTTGGTTTTAGGG 
ATTACGATAACGTTACAACCAGTAGTTGAGGCTAAGGGGCCAAAAGTAGC 
TTATACACAAGAGGGAATGACTGCTCTTTCGGACACAAATAAAGATAAAG 
TCACTACTATTTCTATTGACGAGATTCAAAAAAGCTTAGAAGGTAAGAAGC 
CGATTACTGTTAGTTTTGATATTGATGATACACTGCTTTTCAGTAGTCAATA 



TTTTCAATATGGTAAAGAATATGTAACTCCTGGATCGTTTGATTTTCTTCAT 
AAACAAAAATTCTGGGATCTTGTTGCAAAACGAGGAGATCAAGATTCCAT 
TCCCAAAGAATATGCTAAAAAATTAATTGCTATGCATCAAAAACGAGGAG 
ATAAAATTGTTTTTATAACAGGTAGGACAAGAGGGTCAATGTATAAGGAG 
5 GGCGAGGTTGATAAAACAGCTAAAGCCTTAGCTAAAGATTTTAAATTTGTA 
CCATCTGAT 

(SEQ ID NO: 82) 

MEVLMKKVLVSSLLVLGITITLQPVVEAKGPKVAYTQEGMTALSDTNKDKVT 
1 0 TISIDEIQKSLEGKKPITVSFDIDDTLLFSSQYFQYGKE YVTPGSFDFLHKQKFW 
DLVAKRGDQDSIPKEYAKKLIAMHQKRGDKIVFITGRTRGSMYKEGEVDKTA 
KALAKDFKFVPSD 



15 Sequence description: 

A] Length: 516 bp - 172 aa (partial sequence) 

B] ATG start codon is preceded by a Shine- 
Dalgarno sequence, Possesses a leader peptide 

20 sequence. 



ID-117 
25 Clone 2-17 

(SEQ ID NO: 83) 

ATGCTTAAAAGATTATTTACTGAAGATGGGGAATTGACAAAGATTAGTCGT 
CGTTTCGTTTGGATGTTAGTGGTTATCTATTGTCTTATTATTGTCAGGATGT 

30 GTTTTGGGCCTCAAATTATGATTGAGGGGGTATCAACTCCGAATGTTCAGC 
GCTTCGGAAGAATTGTAGCTCTTTTAGTACCATTTAATTCTTTTCGTAGTTT 
AGATCAGCTAACTAGCTTTAAAGAGATTCTTTGGGTTATTGGTCAAAATGT 
AGTGAATATTTTACTGCTGTTTCCTCTCATTATAGGGTTACTATCCCTAAAG 
CCAAGTTTACGGAAATATAAAAGCGTTATATTACTTGCTTTCTTGATGTCTC 

35 TTTTCATAGAGTGTACTCAAGTTGTTTTAGATATTTTAATAGATGCTAATCG 
GGTTTTTGAAATCGACGATCTATGGACAAATACCTTAGGCGGTCCTTTCGC 
CCTATGGAGTTATCGAAACATAAAAGGTTGGCTTCTAACTATTAGAAAATG 
A 



40 (SEQ ID NO: 84) 

MLKRLFTEDGELTKISRRFVWMLVVIYCLirVRMCFGPQIMIEGVSTPNVQRFG 
RrVALLVPFNSFRSLDQLTSFKEILWVIGQNVVNILLLFPLIIGLLSLKPSLRKYK 
SVILLAFLMSLFIECTQVVLDILIDANRVFEIDDLWTNTLGGPFALWSYRNIKG 
WLLTIRK* 



Sequence description: 



5 A] Length: 516 bp - 172 aa (full-length gene) 

B] ATG start codon is preceded by an Shine- 
Dalgarno sequence. Possesses a potential leader 
peptide sequence. C-terminus need further 
confirmation. 

10 

ID-118 
Clone 3-3 

15 

(SEQ ID NO: 85) 

ATGAAAAAGCTTACTTTTATTTGGGATTTAGATGGGACATTAATAGATTCG 
TATGTACCAATTATGGAAGCTCTTGAAGAAACCTATCGTCATTTTGGCTTA 
ATATTTGATAAAGAATTAATCCATGAATATATTTTACAGGAATCAGTGGGG 

20 CAATTATTGGTAAACCTTTCAGAGGAAGAGCAAATACCTCATGAAAAACT 
GAAAGCATATTTTACAAAAGAACAAGAAAGTCGAGATTCTAAAATACATT 
TAATGCCATATGCAAAAGAGATTTTAGAATGGACCAAAGAACAAGATATT 
CCCAATTTTATGTATACACATAAAGGAGCAAGTACGCATTCAGTGTTGGAA 
ACCTTGCAGATCTCTCATTATTTTGATGAAATTTTAACTGGTGTTTCGGGAT 

25 TCGAGCGAAAACCACATCCACAAGGGATTAATTATTTAGTTAAACGATATT 

CTTTAGATAAATCAATGACTTATTACATAGGAGATCGTCCACTAGATTTGG 
AGGTTGCTCAAAATGCTGGTATAAAATCCATAAACTTAAGGTTAGAGAATT 
CCAAAGAAAACTATAATATTTCAAGTCTCAAAGATATAATATCACTTGATT 
TCACTGGTTTGGATTAA 

30 

(SEQ ID NO: 86) 

MKKLTFIWDLDGTLIDSYVPIMEALEETYRHFGLIFDKELIHEYILQESVGQLL 
VNLSEEEQIPHEKLKAYFTKEQESRDSKIHLMPYAKEILEWTKEQDIPNFMYTH 
KGASTHSVLETLQISHYFDEILTGVSGFERKPHPQGINYLVKRYSLDKSMTYYI 
35 GDRPLDLEVAQNAGIKSINLRLENSKENYNISSLKDnSLDFTRLD* 



Sequence description: 

40 A] Length: 627 bp - 209 aa (Possible Full-length gene) 

B] ATG start codon is preceded by an possible 
Shine-Dalgarno sequence. No obvious leader 
peptide sequence. 



ID-119 
Clone 3-7 

5 

(SEQ ID NO: 87) 

ATGGAAAAAGAAAAAAAATTAGGTCTTTTACCACTAACAATGCTTGTCATT 

GGCTCTCTTATCGGTGGCGGAATCTTTGATTTAATGCAAAATATGAGTTCC 

AGAGCCGGTTTGGTACCAATGCTTATTGCTTGGGTAATTACTGCTATCGGG 

1 0 ATGGG A ACTTTCGTTTT AAGTTTTC AA AATTT ATCTGA A A AAAGGCCGGAC 
CTAACAGCTGGAATCTTTAGTTACGCTAAAGAGGGGTTTGGAAACTTTATG 
GGATTTAACTCTGCATGGGGTTATTGGTTATCAGCTTGGCTTGGAAATGTT 
GCCTACGCTGCACTCTTATTCAGTTCACTCGGTTATTTCTTTAAATTCTTTG 
GTAATGGAAATAATATCATCTCAATTATTGGAGCAAGTATAGTTATTTGGG 

1 5 TTGTCC ATTTCTT AATTTT AAGAGGTGTT A AT AC AGCTGC ATTT ATT AATAC 
CGTAGTTACCTTTGCAAAATTAGTACCTGTTATTATTTTCTTAATTTCAGCG 
TTATTAGCTTTCAAATTTAACATTTTTAGTCTTGATATCTGGGGAAATGGAT 
TACATCAATCAATTTTCAACCAAGTCAATTCAACTATGAAAACCGCTGTTT 
GGGTATTTATTGGTATTGAGGGCGCCGTTGTCTTCTCAGGTCGTGCTAAAA 

20 AACACTCTGATATTGGTAAAGCAAGTATCCTAGCATTATTCACTATGATTT 
CACTTTATGTATTGATTTCTGTTTTATCACTTGGTATCATGTCACGTCCAGA 
ACTTGCAAACTTAAAAACACCAGCTATGGCTTACGTTCTAGAAAAAGCTGT 
TGGTCACTGGGGTGCTATCTTAGTTAACCTTGGTGTTATCATTTCAGTATTT 
GGCGCTATTCTTGCTTGGACTTTATTTGCAGCAGAATTACCATATCAAGCT 

25 GCTAAAGAAGGTGCTTTTCCTAAATTTTTTGCAAAAGAAAATAAAAACAA 
AGCTCCAATCAACTCACTCTTAGTCACTAATCTTTGTGTACAAGCATTCTTA 
ATCACGTTCTTATTCACACAAAGTGCTTATCGTTTTGGTTTCGCATTAGCAT 
CATCTGCTATCTTAATTCCTTATGCTTTTACAGCACTATATCAATTACAATT 
CACACTCCGTGAGGATAAGTCAACTCCAGGACATCAAAAGAATTTAATTA 

30 TCGGTATCCTCGCTACAATCTATGCTGTTTACCTTATCTACGCTGGTGGTTT 
TGATTACTTACTTTTGACAATGATTGCTTATACTCTAGGTATGATTCTCTAT 
ATTAAAATGAGAAAAGATGACAAGCTTGGCGTAATCATGGTCATAGCTGT 
TTCCAGTGTGAAATTGTTATCC 

35 (SEQ ID NO: 88) 

MEKEKKLGLLPLTMLVIGSLIGGGIFDLMQNMSSRAGLVPMLIAWVITAIGMG 
TFVLSFQNLSEKRPDLTAGIFSYAKEGFGNFMGFNSAWGYWLSAWLGNVAY 
AALLFSSLGYFFKFFGNGNNIISIIGASIVIWVVHFLILRGVNTAAFINTVVTFAK 
LVPVIIFLISALLAFKFNIFSLDIWGNGLHQSIFNQVNSTMKTAVWVFIGIEGAV 

40 VFSGRAKKHSDIGKASBLALFTMISLYVLISVLSLGIMSRPELANLKTPAMAYV 
LEKAVGHWGAILVNLGVIISVFGAILAWTLFAAELPYQAAKEGAFPKFFAKEN 
KNKAPINSLLVTNLCVQAFLITFLFTQSAYRFGFALASSAILIPYAFTALYQLQF 
TLREDKSTPGHQKNLIIGILATIYAVYLIYAGGFDYLLLTMIAYTLGMILYIKMR 
KDDKLGVIMVIAVSSVKLLS 



Sequence description: 

5 

A] Length: 1356 bp - 452 aa (partial sequence) 

B] ATG start codon is preceded by an possible 
Shine-Dalgarno sequence. Possesses a potential 
leader peptide sequence. 

10 

ID-120 
Clone 3-8 

15 

(SEQ ID NO: 89) 

ATGAAATTTGAAAAACGGCAGGTCTATTATGTTGTCATAACATTTGCTATT 

TGCTATGCTATACAGGCTTATTGGGGAGCTGTTTCTAATATTTTAACTACGC 

TTCATAAGGCAATATTTCCTTTTTTGATGGGAGCTGGAATTGCCTATATTAT 

20 TAATATTGTAATGTCAGTCTATGAGCGATTATATATAAAGCTTTTTAAAGG 
ATCTAGACTATTAATGGCAATCAAGCGTAGTGTTTCTATGATTTTATCCTAT 
GCAACTTTTATTGGTTTAATTGTCTGGCTATTTTCAATTGTCATTCCAGATT 
TGATTTCTAGTTTGAGTTCTTTATTGGTTATTGATACCGGAGCACTTGCTAA 
ATTGGTTAATAATCTCAATGAAAATAAACAAATTTCTGAGGCTTTAAATTA 

25 TATGGGAACAGATAAAGACTTAGTTTCTACTTTAAGTGGTTATAGCCAGCA 
GATTTTGAAGCAAGTTTTATCTGTTTTAACAAATTTACTAACCTCAGTTTCC 
TCTATTGCGGCAACACTTCTGAATGTTTTTGTTAGTTTTATTTTTTCAATTTA 
CGTTTTGGCAAACAAGGAGCAGTTGGGACGTCAATTTAATTTGTTAATTGA 
TACCTATTTAGGTTCAACAGGCAAAACATTCCATTACGTTCGTCATATCCTT 

30 CATCAACGTTTCCATGGTTTTTTTGTAAGCCAAACTTTAGAAGCTATGATTT 
TAGGAAGTTTGACGGTTATTGGTATGTTGATCTTCCAATTTCCTTATGCTTT 
AACAGTTGGGGTTTTAGTTGCTTTTACAGCTCTAATACCGGTTGTGGGAGC 
CTACATTGGTGTTACAATCGGTTTCATCTTAATTGCTACTGAATCGCTTACT 
GAAGCATTCTTGTTTGTTCTTTTCTTGATCCTTTTACAACAATTTGAGGGAA 

35 ATGTCATTTATCCGAAAGTTGTCGGTGGATCGATTGGACTGCCTTCTATGT 
GGGTTTTAATGGCTATTACTATCGGAGGTGCTTTATGGGGGATCTTAGGCA 
TGTTACTTGCTGTTCCTGTTGCAGCTACTATCTATCAGATTGTAAAAGATCA 
TATTATCAAGCGACAAACGCTTAGAAATCGTGCACGAACCTATCGTTAA 

40 (SEQ ID NO: 90) 

MKFEKRQVYYVVITFAICYAIQAYWGAVSNILTTLHKAIFPFLMGAGIAYIINI 
VMSVYERLYIKLFKGSRLLMAIKRSVSMILSYATFIGLIVWLFSIVIPDLISSLSS 
LLVIDTGALAKLVNNLNENKQISEALNYMGTDKDLVSTLSGYSQQILKQVLSV 
LTNLLTSVSSIAATLLNVFVSFIFSIYVLANKEQLGRQFNLLIDTYLGSTGKTFH 



YVRHILHQRFHGFFVSQTLEAMILGSLTVIGMLIFQFPYALTVGVLVAFTALIP 
VVGAYIGVTIGFILIATESLTEAFLFVLFLILLQQFEGNVIYPKVVGGSIGLPSM 
WVLMAITIGGALWGILGMLLAVPVAATIYQrVKDHIIKRQTLRNRARTYR* 

5 

Sequence description: 

A] Length: 1 134 bp - 378 aa (full-length gene) 

B] ATG start codon is preceded by an typical 
10 Shine-Dalgarno sequence. Possesses a potential 

leader peptide sequence. 



15 ID-121 

Identical to ID-68, as described in WO 00/06736 



20 

ID-122 
Clone 3-16 

25 (SEQ ID NO: 91) 

GTGATTACAATTAAAAAGGAATCTGTTATCAAACTATTGAAGTATGCTTTT 
GGCATTATAATGGGATTTATTATCTTAGCTATTGTAATAGGTGGGCTCCTA 
TTTGCATACTACGTTAGTCGTTCTCCGAAATTAACCGATCAAGCTTTAAAA 
TCCGTTAACTCTAGTTTGGTTTATGATGGTAATAATAAACTTATTGCCGATT 

30 TAGGCTCAGAAAAGCGTGAAAGTGTTAGTGCGGATAGCATTCCACTAAAT 
TTGGTTAACGCTATCACTTCTATAGAAGATAAACGTTTCTTTAAACATAGA 
GGTGTCGATATTTATCGTATTTTAGGTGCAGCTTGGCATAACCTTGTTAGTA 
GTAATACGCAAGGTGGTTCAACCCTTGATCAACAGTTGATTAAACTGGCTT 
ACTTTTCTACCAATAAATCTGACCAAACGTTAAAACGTAAATCACAGGAA 

3 5 GTTTGGCTTGCGCTTC A A ATGG AGCGT AA AT AC ACC A A AG A AG A A ATTCTT 
ACTTTCTATATTAATAAAGTTTATATGGGAAATGGGAATTATGGTATGAGA 
ACAACAGCTAAATCATACTTTGGTAAAGACCTAAAGGAATTATCTATTGCA 
CAACTTGCTTTGCTCGCTGGTATTCCTCAAGCACCTACACAATATGACCCTT 
ATAAAAACCCAGAATCTGCTCAAACAAGACGTAATACCGTTCTTCAGCAG 

40 ATGTATCAAGATAAAAACATTTCTAAAAAGGAATACGACCAAGCTGTTGC 
AACTCCAGTAACTGATGGCTTAAAAGAATTAAAGCAAAAATCTACTTATCC 
AAAATATATGGATAACTACTTAAAACAAGTTATTAGTGAAGTTAAACAAA 
AAACTGGTAAAGATATCTTTACTGCTGGGCTAAAAGTGTATACTAATATCA 
ACACTGATGCACAAAAACAACTATATGACATCTACAACAGTGATACTTAC 



ATCGCTTATCCAAACAATGAATTACAAATAGCATCTACCATCATGGATGCG 
ACTAATGGTAAAGTCATTGCACAATTAGGCGGGCGTCATCAGAATGAAAA 
TATTTCATTTGGGACAAATCAATCTGTCTTAACAGACCGCGATTGGGGTTC 
TACAATGAAACCTATCTCAGCTTATGCACCTGCTATTGATAGTGGTGTCTA 
5 TAATTCAACAGGTCAATCATTAAACGACTCAGTTTACTACTGGCCTGGTAC 
TTCTACTCAACTATATGACTGGGATCGTCAATATATGGGTTGGATGAGTAT 
GCAGACCGCTATTCAACAATCACGTAACGTCCCTGCTGTCAGAGCACTTGA 
AGCCGCTGGATTAGACGAAGCAAAATCTTTCCTTGAAAAATTAGGCATAT 
ACTATCCAGAAATG 

10 

(SEQ ID NO: 92) 

MITIKKESVIKLLKYAFGIIMGFIILAIVIGGLLFAYYVSRSPKLTDQALKSVNSS 

LVYDGNNKLIADLGSEKRESVSADSIPLNLVNAITSIEDKRFFKHRGVDIYRILG 

AAWHNLVSSNTQGGSTLDQQLIKLAYFSTNKSDQTLKRKSQEVWLALQMER 

1 5 KYTKEEILTFYINKV YMGNGNYGMRTTAKS YFGKDLKELSIAQLALLAGIPQA 
PTQYDPYKNPESAQTRRNTVLQQMYQDKNISKKEYDQAVATPVTDGLKELK 
QKSTYPKYMDNYLKQVISEVKQKTGKDIFTAGLKVYTNINTDAQKQLYDIYN 
SDTYIAYPNNELQIASTIMDATNGKVIAQLGGRHQNENISFGTNQSVLTDRDW 
GSTMKPISAYAPAIDSGVYNSTGQSLNDSVYYWPGTSTQLYDWDRQYMGWM 

20 SMQTAIQQSRNVPAVRALEAAGLDEAKSFLEKLGIYYPEM 



Sequence description: 

25 A] Length: 1386 bp - 462 aa (partial sequence) 

B] GTG start codon is preceded by an 
typical Shine-Dalgarno sequence. Possesses a 
potential leader peptide sequence. 

30 

ID- 123 
Clone 3-17 

35 (SEQ ID NO: 93) 

ATGGCTAATGTATATGATTTAGCAAATGAATTAGAACGTGCTGTTCGTGCT 
TTACCAGAATACCAAGCAGTTTTAACTGCAAAAGCAGCTATTGAAAATGA 
TGCGGATGCACAAGTGCTTTGGCAAGACTTTTTGGCTACCCAATCAAAAGT 
TCAAGAAATGATGCAATCTGGCGAAATGCCAAGTOAAGAAGAAOAAGATG 

40 AAATGTCTAAACTTGGGGAAAAAATTGAATCCAATGACCTTTTAAAAGTTT 
ATTTTGACCAACAACAACGGTTGTCTGTCTATATGTCTGATATCGAAAAAA 
TTGTCTTTGCACCCATGCAGGACTTGATGTAA 



(SEQ ID NO: 94) 

MANVYDLANELERAVRALPEYQAVLTAKAAffiNDADAQVLWQDFLATQSK 
VQEMM QSGQMPSQEEQDEM S KLGEKIESNDLLK V YFDQQQRLS V YMSDIEKI 
VFAPMQDLM* 



15 

ID- 124 
Clone 3-26 

20 (SEQ ID NO: 95) 

ATGGCAGAAATCACAGCTAAACTTGTAAAAGAATTGCGTGAAAAATCAGG 
TGCAGGCGTTATGGACGCTAAAAAAGCATTAGTAGAAACTGATGGTGACC 
TTGATAAAGCGATTGAATTACTTCGCGAAAAAGGTATGGCTAAAGCAGCT 
AAAAAAGCAGACCGTGTTGCTGCTGAAGGTTTAACAGGTGTTTATGTTGAT 

25 GGTAACGTTGCAGCAGTTATTGAAGTTAA 

(SEQ ID NO: 96) 

MAEITAKLVKELREKSGAGVMDAKKALVETDGDLDKAIELLREKGMAKAAK 
KADRVAAEGLTGVYVDGNVAAVIEV 



Sequence description: 



10 



A] Length: 336 bp - 1 12 aa (full length sequence) 

B] ATG start codon is preceded by an 
typical Shine-Dalgarno sequence. No obvious 
potential leader peptide sequence. 



30 



Sequence description: 



35 



A] Length: 230 bp - 76 aa (partial sequence) 

B] ATG start codon is preceded by an 
typical Shine-Dalgarno sequence. No obvious 
potential leader peptide sequence. 



40 



ID- 125 



Clone 3-33 



(SEQ ID NO: 97) 

ATGATAAAAAACCTGTTATTAACAGGTTTTTTATCATTTAATGACGGAAAA 
CTGGACACAAATTATTTTTCTTGTATAATTAAATATATTATTTCTTATCAGG 
AGGTTATGATGACATTAGAGAAACGATTTAA 

5 

(SEQ ID NO: 98) 

MKNLLLTGFLSFNDGKLDTNYFSCHKYIISYQEVMMTLEKRF 



10 Sequence description: 

A] Length: 134 bp - 44 aa (partial sequence) 

B] ATG start codon is preceded by an 
typical Shine-Dalgarno sequence. Possible 

1 5 potential leader peptide sequence. 



ID- 126 
20 Clone 3-41 

(SEQ ID NO: 99) 

ATGAAAAATAATAAAAATAATGGTTTTCTGAAAAATTCCTTTATTTACATA 
TTATTGATTATTGCGGTTATTACAACCTTTCAATACTATTTAA 

25 

(SEQ ID NO: 100) 

MKNNKNNGFLKNSFIYILLIIAVITTFQYYL 



30 Sequence description: 

A] Length: 94 bp - 31 aa (partial sequence) 

B] ATG start codon is preceded by a 
possible Shine-Dalgarno sequence. Potential 

35 leader peptide sequence. 



ID- 127 
40 Clone 3-42 

(SEQ ID NO: 101) 

ATGTTAGATATTATCTTATCCGGAATTTCGCAAGGATTACTTTGGTCAATTA 
TGGCAATTGGCGTGTTTATCACTTTTCGTATCTTAGACATAGCCGATCTCTC 



TGCAGAAGGGGCTTTCCCTATGGGGGCTGCAGTTTGCGCCTTATGTATCGT 
TAA 



(SEQ ID NO: 102) 

5 MLDIILSGISQGLLWSIMAIGVFITFRILDIADLSAEGAFPMGAAVCALCrV 



Sequence description: 

10 A] Length: 158 bp - 52 aa (partial sequence) 

B] ATG start codon is preceded by a 
possible Shine-Dalgarno sequence. Potential 
leader peptide sequence. 

15 

ID- 128 

Clone 3-43 

20 (SEQ ID NO: 103) 

ATGGAAATGCCTAAAAGAAATGAATTACTCAATAAAGAAATTAAAATGAG 
TATTGATAAACTTAGATATAAAGAACCAGAGAGTGAACATGACAAGCGAC 
CTACTTTTTATTTGGTAGTACTTATACTTGTTACTGTAGCAGTTATATTGTC 
GTTATTTAA 

25 

(SEQ ID NO: 104) 

MEMPKRNELLNKEIKMSIDKLRYKEPESEHDKRPTFYLVVLILVTVAVILSLF 



30 Sequence description: 

A] Length: 161 bp - 53 aa (full-length gene) 

B] ATG start codon is preceded by a 
possible Shine-Dalgarno sequence. Potential 

35 leader peptide sequence. 



ID- 129 
40 Clone 3-44 

(SEQ ID NO: 105) 

GTGGTAAGTAAATTGAGTTTAACAACGATTTTTGCATTGCTATTTTCATCA 
ATGCTAATTTACGCAACACCTCTTATCTTTACAAGTATTGGGGGAACCTTC 



TCTGAACGTGGTGGTATCGTCAACGTTGGTTTAGAAGGAATTATGGTAATT 
GGAGCTTTCTCAGGCGTTGTATTTAA 



(SEQ ID NO: 106) 

5 MVSKLSLTTEFALLFSSMLIYATPLIFTSIGGTFSERGGIVNVGLEGIMVIGAFSG 
VVF 



Sequence description: 

10 

A] Length: 179 bp - 59 aa (partial sequence) 

B] GTG start codon is preceded by a 
possible Shine-Dalgarno sequence. Potential 
leader peptide sequence. 

15 

ID-130 
Clone 3-46/47 

20 

(SEQ ID NO: 107) 

ATGAGAATTATTGCAATAACTGAAAAGGTTATAAAAGAACTGTTTCGTGAT 
AAAAGAACACTTGCTATGATGTTTTTAGCACCTATTTTAATTATGTTTTTGA 
TGAATGTTATGTTTTCTGCGAATAGTAATACAAAAGTTAAGATTGGAACTA 

25 TTAACGTTAACACGAAGGTCGTTTCAAATTTAGATAATATTAAGCATATTC 
AAGTGAGATCATTTAAATTTAACTCATCTGCTAAAAAAGCACTCAAATCAA 
ATAAAATTGATGCTCTTATTTCGGAGGACAATAAATCTTATACTGTCTTCT 
ATGCGAATACAGATTCTTCAAAGACGACTTTAACAAGACAAGCTTTTAAA 
ACCGCTGTTAATACAATGAACAGTAAGGAACTGATTTCGCAAGTTAAAATT 

30 TTAGCTAATAAGAATGCGAAACTAGCACAATCCTTACAAACTCGCTCCAAA 
TATATCAAAGAAAAATATAATTACGGAAATAAAAATACAGGCTTTTTTGC 
AAAAATGATACCAATACTAATGGGATTTATGGTCTTCTTCTTGGTTTTT 

(SEQ ID NO: 108) 

35 MRIIAITEKVIKELFRDKRTLAMMFLAPILIMFLMNVMFSANSNTKVKIGTINV 
NTKVVSNLDNIKHIQVRSFKFNSSAKKALKSNKIDALISEDNKSYTVFYANTDS 
SKTTLTRQAFKTAVNTMNSKELISQVKILANKNPKLAQSLQTRSKYIKEKYNY 
GNKNTGFFAKMIPILMGFMVFFLVF 

40 

Sequence description: 

A] Length: 558 bp - 186 aa (partial sequence) 

B] ATG start codon is preceded by a 



possible Shine-Dalgarno sequence. Potential 
leader peptide sequence. C-terminus has yet to be 
determined. 



5 

ID-131 

Clone 3-48 

10 (SEQ ID NO: 109) 

GTGATTATCGTTATGAGTAAACATCAAGAAATTTTGGAGTACCTAGAAAAT 
TTAGCTGTTGGTAAGAGGGTTAGTGTACGCAGTATTTCAAATCATTTAA 

(SEQ ID NO: 1 10) 
15 MirVMSKHQEILEYLENLAVGKRVSVRSISNHL 



Sequence description: 

20 A] Length: 100 bp - 33 aa (partial sequence) 

B] GTG start codon is not preceded by a 
obvious Shine-Dalgarno sequence. No obvious 
leader peptide sequence. 

25 

ID-132 
Clone 2-c53 

30 (SEQ ID NO: 111) 

ATGTATAGAGAAATTACCGCTGTCGAACACGATCGCTTTGTGAGCGAATCC 
AACCAAACAAACCTACTTCAATCTCTTAATTGGCCCAAAGTAAAAGACAA 
CTGGGGTAGTCAATTACTTGGCTTTTTTGACGGTGAAACCCAAATTGCCAG 
CGCTAGTATTCTCATCAAATCACTTCCTCTTGGCTTCTCCATGCTGTATATT 

35 CCGCGTGGACCAATCATGGATTACTCCAATCTAGATATTGTAACTAAGGTC 
CTTAAGGACCTTAAAGCTTTTGGCAAAAAACAAAGAGCTCTCTTTATCAAG 
TGTGATCCTCTCATCTATTT 

(SEQ ID NO: 112) 

40 MYREITAVEHDRFVSESNQTNLLQSLNWPKVKDNWGSQLLGFFDGETQIASA 
SILIKSLPLGFSMLYIPRGPIMDYSNLDIVTKVLKDLKAFGKKQRALFIKCDPLI 
Y 



Sequence description: 



5 A] Length: 326 bp - 108 aa (partial sequence) 

B] ATG start codon is preceded by an obvious 
Shine-Dalgarno sequence. No obvious leader 
peptide sequence. 

10 

ID-133 
Clone 2-c59 

15 (SEQ ID NO: 113) 

ATGGACAAGAAAAAAATCTTAGTAACGGGTATTGTGCCTAAAGAAGGTCT 
AAGAAAGCTTATGGACCGATTTGATGTTACTTATTCAGAAGATCGCCCATT 
TTCACGTGACTATGTGTTAGAGCATTTATCTGAATATGACGGATGGTTACT 
CATGGGACAAAAAGGTGATAAAGAGATGATTGATGCAGGTGAAAACTTAC 

20 AAATTATTTCTTT 

(SEQ ID NO: 1 14) 

MDKKKLLVTGIVPKEGLRKLMDRFDVTYSEDRPFSRDYVLEHLSEYDGWLLM 
GQKGDKEMIDAGENLQIIS 



Sequence description: 



30 A] Length: 215 bp - 71 aa (partial sequence) 

B] ATG start codon is preceded by an obvious 
Shine-Dalgarno sequence. No obvious leader 
peptide sequence. 

35 

ID-134 

Clone 2-c62 

40 (SEQ ID NO: 115) 

ATTTCGAAAGATGACTACCAAAATATTAGTTTTGGACAGGATCCAGAAGTT 
GTTGATTATGCTGGTCTGTTTGAAAAACGCCGTCCAGTTTTAGAAAAAGCA 
GTTAAAAATTTCTTGCAAGAAGAGAGAGCTACGAGAATGCTATCTGATTTC 
TTGCAAGAAGAAAAATGGGTAACTGATTTTGCTGAATTTATGGCGATCAA 



AGAACATTTTGGTAATAAGGCGCTTCAAGAATGGGATGACAAGGCTATTA 
TACGCCGCGAAGAAGAAGCCTTAGCAGGATATCGTCAAAAGCTTAGTGAA 
GTGATAAAATATCATGAAGTAACGCAATATTTCTTTTACAAACAATGGTTT 
GAGTTAAAAGAATATGCTAATGATAAAGGGATTCAAATTATCGGTGATAT 
5 GCCAATCTACGTTTCTGCCGATAGTGTAGAAGTTTGGACAATGCCTGAACT 
GTTT 

(SEQIDNO: 116) 

ISKDDYQNISFGQDPEVVDYAGLFEKRRPVLEKAVKNFLQEERATRMLSDFLQ 
10 EEKWVTDFAEFMAIKEHFGNKALQEWDDKAIIRREEEALAGYRQKLSEVIKY 
HEVTQYFFYKQWFELKEYANDKGIQIIGDMPIYVSADSVEVWTMPELF 

A] Length: 459 bp - 153 aa (partial sequence) 
15 B] More sequencing is required to determine the 

N- and C-termini 

enzyme). - Streptococcus pneumoniae (63%) 

20 ID- 135 

Identical to ID- 108 described in WO 00/06736 
Clone 2-c63 

25 

ID-136 
Clone 2-c66 

30 

(SEQIDNO: 117) 

ATGGCAAAACAGAAAAATAACTGGCGCCGTGTTGGAGTTGGTGTCCTTAC 
ACTTGCTTCAGTTGCGACTCTTGCTGCATGTGGAAGTAAATCAGCTTCCCA 
GGATTCTAATGGAGCGATTAATTGGGCTATTCCAACAGAAATCAATACACT 

35 AGATTTATCTAAAGTTACAGACACTTACTCAAATCTAGCTATTGGTAACTC 
TAGTAGTAATTTCCTTCGCTTAGATAAAGATGGAAAGACAAGACCAGACTT 
GGCTACTAAAGTTGATGTTTCAAAAGATGGCTTAACTTATACAGCTACATT 
ACGTAAAGGCTTGAAGTGGTCAGATGGCAGTAAACTTACTGCAAAGGATT 
TTGTTTATTCATGGCAACGTTTAGTTGATCCTAAAACAGCTTCACAATATG 

40 CTTACCTTGCTGTTGAAGGGCATGTGCTTAATGCCGATAAAATCAACGAAG 
GACAAGAGAAAGACTTGAATAAGCTAGGTGTTAAGGCAGAAGGCGATGA 
CAAAGTTGTTATTACTTTATCTAGTCCGTCTCCGCAATTCATCTACTACCTT 
GCATTCACTAACTTCATGCCACAAAAACAAGAAGTTGTTGAAAAATATGG 
AAAAGATTACGCAACTACTTCAAAAAATACAGTTTACTCAGGACCATATA 



CTGTTGAAGGTTGGAATGGTTCGAATGGTACTTTCACGCTGAAGAAAAAC 
AAAAATTATTGGGACGCTAAAAATGTAAAAACAAAAGAAGTTCGCATCCA 
GACTGTTAAAAAACCAGATACCGCCGTTCAAATGTATAAACGTGGTGAGT 
TAGATGCAGCTAATATCTCAAATACTTCTGCTATTTATCAAGCTAATAAAA 
5 ATAATAAAGATGTCACAGATGTTCTAGAAGCGACCACTGCCTATATGGAA 
TATAATACTACTGGTTCTGTGAAAGGGCTTGATAATGTTAAGATTCGTCGC 
GCCTTAAACTTAGCAACTAACCGTAAAGGAGTTGTTCAAGCAGCCGTTGAT 
ACAGGCTCAAAACCGGCAATTGCTTTTGCACCTACTGGTTTAGCCAAAACA 
CCAGATGGAACTGATTTGGCAAAATATGTTGCCCCAGGTTATGAATATAAT 
1 0 AAAACTGAAGCAGC AAAACTCTTTAGACTA 



(SEQIDNO: 118) 

MAKQKNNWRRVGVGVLTLASVATLAACGSKSASQDSNGAINWAIPTEINTLD 
LSKVTDTYSNLAIGNSSSNFLRLDKDGKTRPDLATKVDVSKDGLTYTATLRKG 

1 5 LKWSDGSKLTAKDFV YS WQRLVDPKTASQ YA YLA VEGHVLNADKINEGQEK 
DLNKLGVKAEGDDKVVITLSSPSPQFIYYLAFTNFMPQKQEVVEKYGKDYAT 
TSKNTVYSGPYTVEGWNGSNGTFTLKKNKNYWDAKNVKTKEVRIQTVKKPD 
TAVQMYKRGELDAANISNTSAIYQANKNNKDVTDVLEATTAYMEYNTTGSV 
KGLDNVKIRRALNLATNRKGVVQAAVDTGSKPAIAFAPTGLAKTPDGTDLAK 

20 YVAPGYEYNKTEAAKLFRL 



Sequence description: 

25 

A] Length: 1 143 bp - 381 aa (partial sequence) 

B] Shine -Dalgarno sequence precedes ATG codon. 
Possesses a potential leader peptide sequence. 

30 

ID- 137 
Clone 2-c67 



35 (SEQIDNO: 119) 

TTGAGAGTTTATGAAAATAAAGAAGAGTTGAAAAAAGAAATAAGTAAAAC 
ATTTGAGAAATACATTATGGAATTTAATAA 

TATTCCAGAGAATCTAAAAGATAAAAGAATTGATGAAGTTGATAGAACTC 
C AGGAGA AA ACGTTTCTTATC AGGTTGGCT 
40 GGACCAACTTGGTTCTTAAATGGGAAGAAGATGAAAGAAAGGGACTTCAA 
GTAAAAACACCATCGGATAAATTT 



(SEQ ID NO: 120) 

MRVYENKEELKKEISKTFEKYIMEFNNIPENLKDKRIDEVDRTPAENLSYQVG 
WTNLVLKWEEDERKGLQVKTPSDKF 

5 

Sequence description 

A] Length: 234 bp - 78 aa (partial sequence) 

B] TTG start codon is preceded by a 

10 potential Shine-Dalgarno sequence. No obvious 

leader peptide sequence. 



15 ID-138 

Clone 2-c70 
(SEQ ID NO: 121) 

20 ATGTCAAAGTTTGATAGTCAGAAAATAATTACTCCGATTATGAAGTTTGTC. 
AATATGCGAGGGATTATTGCACTCAAAGATGGCATGCTAGCAATTTTACCA 
CTAACAGTTGTTGGGAGTCTCTTTTTAATATTAGGGCAGCTTCCATTT 

(SEQ ID NO: 122) 

25 MSKFDSQKIITPIMKPVNMRGIIALKDGMLAILPLTVVGSLFLILGQLPF 



Sequence description 

30 A] Length: 150 bp - 50 aa (partial sequence) 

B] ATG start codon is preceded by a potential 
Shine-Dalgarno sequence. Possesses a potential 
leader peptide sequence. 

35 

ID- 139 

Clone 2-c7 1 

40 (SEQ ID NO: 123) 

GAGACCACTTCATCAGTTAAACCAGCAGGAATTGACCGTATCAATCATACC 
TCAACACCCCCGAAGAAAACTACCCCCAACATTGCAACGACGCATAGCTT 
CAAAGATCGTTGTGATACTTTAGAAAGAATTCACAATGAAGACATTGATGT 
TTGTTCTGGATTCATTTGTGGTATGGGAGAGAGCGATGAGGGGCTCATCAC 



ATTAGCTTTCAGACTAAAAGAACTGAACCCCTATTCTATCCCTGTCAATTTT 
TTACTTGCTGTTGAAGGAACACCTCTTGGAAAATATAACTATTTGACTCCC 
ATTAAATGCTTAAAAATTATGGCCATGTTGCGTTTTGTTTTTCCTTTCAAGG 
AATTAAGATTAAGTGCTGGACGGGAGGTCCATTTTGAGAATTTTGAATCAT 
5 TAGTCACCTTACTTGTTGACTCAACTTTTTTGGGAAATTACCTAACAGAGG 
GGGGTCGCAATCAACATACCGATATTGAATTCTTGGAAAAATTACAACTA 
AATCATACTAAAAAGGAATTAATTT 

(SEQIDNO: 124) 

10 ETTSSVKPAGIDRINHTSTPPKKTTPNIATTHSFKDRCDTLERIHNEDIDVCSGFI 
CGMGESDEGLITLAFRLKELNPYSIPVNFLLAVEGTPLGKYNYLTPIKCLKIMA 
MLRFVFPFKELRLSAGREVHFENFESLVTLLVDSTFLGNYLTEGGRNQHTDIEF 
LEKLQLNHTKKELI 

15 

Sequence description: 



A] Length: 535 bp - 178 aa (partial sequence) 
20 B] N- and C-termini require verification 



ID- 140 
25 Clone 2-c73 

(SEQIDNO: 125) 

ATGCCGGTTTGGACTGCACAGTCTATTCCAAAGGCATTTTTAGAAAAGCAT 
AATACTAAGGAAGGCACCTGGGCAAAACTAACCATTCTAAGTGGTTCTTTA 

30 GTATTTTACCAGTTATCTCCTGATGGAGAGGAAATCTCGCGGCATATTTTT 
GATGCTAGTAGTGATATTCCTTTTGTTGATCCACAAGTCTGGCATAAAGTT 
TCGGCGAATAGTCCAGACTTAAGTTGCTATCTAACTTTTTACTGCCAAAAA 
GAAGATTACTTCCATAAAAAATATGGTCTCACGCGCACACATTCTGAGGTT 
ATCGCCAGTGCACCTCTCTTATCTGAGAAGAGTAATATATTAGACCTTGGG 

35 TGTGGTCAAGGGCGAAACTCACTTTATTTATCGCTGCTGGGACATCAAGTG 
ACTTCTGTCGATTCAAACGGACAGAGCCTTGTAGCTTTAGAAAATATGGCA 
TTAGAAGAAGAGCTTCCTTACAATATAAAAAGGTATGATATTAATACTACT 
GCTATTGAAGGGCACTATGATTTTATTTTATCAACTGTGGTATTTATGTTTT 
T 

40 

(SEQIDNO: 126) 

MPVWTAQSIPKAFLEKHNTKEGTWAKLTILSGSLVFYQLSPDGEEISRHIFDAS 
SDIPFVDPQVWHKVSPNSPDLSCYLTFYCQKEDYFHKKYGLTRTHSEVIASAP 



LLSEKSNILDLGCGQGRNSLYLSLLGHQVTSVDSNGQSLVALENMALEEELPY 
NIKRYDINTTAIEGHYDFILSTVVFMF 



5 Sequence description: 

A] Length: 563 bp - 187 aa (partial sequence) 

B] N- and C-termini require verification 

10 

ID-141 
Clone 2c76 

15 

(SEQ ID NO: 127) 

ATGACAAAGCAAATAATTGCCATTTGGGCTGAAGATGAAGACCATTTGAT 

TGGAGTTAATGGCGGTTTACCATGGAGGCTTCCTAAAGAGTTACATCACTT 

CAAAGAAACGACCATGGGGCAGGCTTTGCTTATGGGACGAAAGACCTTTG 

20 ATGGAATGAACCGTCGTGTTTTACCTGGTAGAGAGACAATCATCTTAACAA 
AAGATGAACAATTCCAAGCAGATGGAGTGACAGTCCTAAATAGTGTTGAA 
CAAGTTATAAAATGGTTTCAGGAACATAATAAGACCTTATTTATTGTAGGT 
GGTGCAAGTATTTATAAAGCATTTCTGCCTTATTGTGAAGCAATCATAAAA 
ACTAAAGTTCATGGAAAATTCAAAGGTGATACCTATTTTCCTGATGTTAAT 

25 CTATCTGAGTTT 

(SEQ ID NO: 128) 

MTKQIIAIWAEDEDHLIGVNGGLPWRLPKELHHFKETTMGQALLMGRKTFDG 
MNRRVLPGRETIILTKDEQFQADGVTVLNSVEQVIKWFQEHNKTLFIVGGASI 
30 YKAFLPYCEAIIKTKVHGKFKGDTYFPDVNLSEF 



Sequence description: 

35 

A] Length: 417 bp - 139 aa (partial sequence) 

B] ATG start codon is preceded by a Shine- 
Dalgarno sequence. No leader peptide sequence 

40 

ID-142 



Clone 2-c78 



(SEQ ID NO: 129) 

TTGTGGCCAAACTGTGCCCCGCTTATTAATAGCACTTTGTTCACCATTGAA 
GATATCTTAACATCAGGTGCTCATAGCAACCCTATTTTAATGGGGGTTATA 
CTTGGCGGGACAATTGTAGTAGTGGCGACAGCACCACTTTCTTCTATGGCA 
5 TTGACAGCTATGCTAGGATTAACCGGAATGCCTATGGCTATAGGAGCCTTG 
TCTGTCTTTGGTTCGTCATTTATGAATGGTGTACTTTTCCATAAATTAAAAC 
TTGGAAGTCGTAAAGATAATATAGCTTTTGCTGTTGAGCCTCTAACTCAAG 
CTGACGTGACTTCAGCTAACCCTATTCCAATCTATGTCACTAATTTTGTTGG 
TGGTGCAGCTTGTGGTATTTTAATTGCCTTGATGAAATTAGTTAATGATACT 
1 0 CCTGG A AC AGCG AC ACC A ATTGC AGG ATTTGCTGTC ATGTTTGCCT AT A AC 
CCAATGATAAAAGTACTAATAACCGCTCTAGGTTGTATTATCCTATCTTTA 
CTAGCAGGCTATTTTGGAGGCATTGTTTTT 

(SEQ ID NO: 130) 

1 5 MWPNCAPLINSTLFTffiDILTSGAHSNPILMGVILGGTIV VVATAPLSSMALTA 
MLGLTGMPMAIGALSVFGSSFMNGVLFHKLKLGSRKDNIAFAVEPLTQADVT 
SANPIPIYVTNFVGGAACGILIALMKLVNDTPGTATPIAGFAVMFAYNPMIKVL 
ITALGCIILSLLAGYFGGIVF 

20 

Sequence description: 



A] Length: 540 bp - 180 aa (partial sequence) 
25 B] N- and C-termini have yet to be elucidated 



ID-143 
30 Clone 2-c80 

(SEQ ID NO: 131) 

ATGTTTTTAAGTATAATGGCAGGTGTCATAGCATTTGTCCTGACAGTTATT 
GCCATTCCACGCTTCATTAAGTTTTACCAATTGAAGAAAATTGGCGGGCAA 
35 CAAATGCATGAAGATGTCAAACAACATCTAGCCAAAGCAGGTACGCCGAC 
AATGGGAGGAACGGTATTTT 

(SEQ ID NO: 132) 

MFLSIMAGVIAFVLTVIAIPRFIKFYQLKKIGGQQMHEDVKQHLAKAGTPTMG 

40 GTVF 



Sequence description: 



A] Length: 172 bp - 57 aa (partial sequence) 

B] Shine Dalgarno sequence precedes 'ATG' start 
codon. Possesses a potential leader peptide 
sequence. 

ID- 144 
Clone 3-83 
(SEQIDNO: 133) 

ATGAAACCATATTTATCTTTTATTGGTAGAACGTTATTATACTTCGGTATTT 
TATTGTTACTAATTTACTTTTTTGCATACCTTGGTCGCGGACAAGGCAGTTT 
TATTTATAA 

(SEQIDNO: 134) 

MKPYLSFIGRTLLYFGILLLLIYFFAYLGRGQGSFIY 



Sequence description: 

A] Length: 1 13 bp - 37 aa (partial sequence) 

B] Putative ATG start codon is preceded by a 
typical Shine-Dalgarno sequence. Possesses a 
potential leader peptide sequence. 

This orf is not in frame with nuc 



ID- 145 

Clone 3-86 
(SEQIDNO: 135) 

ATGTCATATTTTAGAAATTACTGGTATCGTTTTGGAGCAATTTTATTTATTA 

TTTTAGCAGTAATATTGCTTGTTTTTAGACCTGACTGGTCAATGCTTCACTA 

TCTATTGTATTTTTACTTTATGGCACTTCTAGCGCATCAATTTGAAGAATAT 

CAGTTTCCCGGTGGGGCATCACCTATCATTAACTATGTTGTTTATGATGAA 

GAAGAGCTGATGGATTGTTTTCCAGGCAATACTCAGTCTATTATGTTGGTT 

AATACTATTGCTTGGTTGCTTTACATTGCTAGTATTGCTTTTCCTCAAGCTT 

ATTGGCTTGGATTAGGAGTCATGTTCTTTAGTCTAACGCAGCTCTTGGGTC 

ATGGTTTTCAGATGAATATTAAACTTAAAACTTGGTATAATCCTGGTCTAG 

CAACGACAGTATTTCTCCTAGTACCAATAGCTTGCGCATACATCTATCAAG 



CTAGTGCAGAAGGAATGCTCACTTGGGGAGATTGGCTAGGTGGTTTTATCA 
TGTTGATTGTCTGTGTACTAACTAGCATTATTGCACCTGTACAGCTATTGAA 
GGATAAGGAGACCAATTATATTATTAGTCCTTGGCAAATGGACCGTTTTCA 
TAAGGTCGTTAATTTTGTAAGGATAAAAAAATAA 

(SEQ ID NO: 136) 

MSYFRNYWYRFGAILFIILAVILLVFRPDWSMLHYLLYFYFMALLAHQFEEYQ 

FPGGASPIINYVVYDEEELMDCFPGNTQSIMLVNTIAWLLYIASIAFPQAYWLG 

LGVMFFSLTQLLGHGFQMNIKLKTWYNPGLATTVFLLVPIACAYIYQASAEG 

MLTWGDWLGGFIMLIVCVLTSIIAPVQLLKDKETNYIISPWQMDRFHKVVNFV 

RIKK* 



Sequence description: 

A] Length: 651 bp - 219 aa (full length gene) 

B] Putative ATG start codon is preceded by a 
typical Shine-Dalgarno sequence. Possesses a 
potential leader peptide sequence. 

ID-146 
Clone 3-c88 
(SEQ ID NO: 1 37) 

ATGCCACTTACAGCACTTGAAATTAAAGATAAAACATTTTCATCAAAATTT 
CGCGGTTATAGCGAAGAAGAAGTT 

(SEQ ID NO: 138) 

MPLTALEIKDKTFSSKFRGYSEEEV 



Sequence description: 

A] Length: 75 bp - 25 aa (partial sequence) 

B] Putative ATG start codon is preceded by a 
typical Shine-Dalgarno sequence. No leader 
peptide 



ID- 147 



Clone 3-90 



(SEQIDNO: 139) 

5 ATGTCACTTTTTCAAGAAAAAATTGCTTACAATTGCGCTAAAAAGGAAGCG 
CTTTATAAAGAGAGTTTAGGACGCTACGCCTTGAGATCAATGCTAGCAGG 
GGCTTATTTGACAATGAGTACTGCTGCCGGTATCGTCGCAGCTGATACTAT 
TGGTAAAATTTCTCCTGCTCTATCAGGTTTTGTATTTGCTTTCATCTTTAGTT 
TTGGACTTATTTATGTTTTAATATTTAATGGTGAATTGGCGACATCTAATAT 
1 0 GCTTTATCTCACTGC AGG AGCCTAT A AT A A A AAT ATCTCTTGG A AA A AAGC 
CATAACAATTTTAATTTATTGTACTTTTTTCAACCTCGTTGGTGCTTGTATA 
TTAGCTTGGTTGTTTAA 

(SEQ ID NO: 140) 

1 5 MSLFQEKIAYNC AKKEALYKESLGR YALRSMLAGAYLTMSTAAGIV AADTIG 
KISPALSGFVFAFIFSFGLIYVLIFNGELATSNMLYLTAGAYNKNISWKKAITILI 
YCTFFNLVGACILAWLF 



20 Sequence description 

A] Length: 406 bp - 125 aa (partial sequence) 

B] Putative ATG start codon is preceded by a 
25 typical Shine-Dalgarno sequence. Possible 

leader peptide 

ID- 148 

30 

Clone 3-92 

(SEQ ID NO: 141) 

AAGTTACAAGCGACTGAAGTTAAGAGCGTTCCGGTAGCACAACCAGCTTC 
35 AACAACAAATGCAGTAGCTGCACATCCTGAAAATGCAGGGCTCCAACCTC 
ATGTTGCAGCTTATAAAGAAAAAGTAGCGTCAACTTATGGAGTTAATGAA 
TTCAGTACATACCGTGCGGGAGATCCAGGTGATCATGGTAAAGGTTTAGC 
AGTTGACTTTATTGTAGGTAAAAACCAAGCACTTGGTAATGAAGTTGCACA 
GTACTCTACACAAA ATATGGCAGCAAATAAC ATTTG ATATGTTATCTGGGA 
40 ACAAAAGTTTTATTCAAATACAAATAGTATTTATGGACCTGCTAATACTTG 
GAATGCAATGCCAGATCGTGGTGGCGTTACTGCCAACCACTATGACCACGT 
TCACGTATCATTTAA 



(SEQ ID NO: 142) 

KLQATEVKSVPVAQPASTTNAVAAHPENAGLQPHVAAYKEKVASTYGVNEF 
STYRAGDPGDHGKGLAVDFrVGKNQALGNEVAQYSTQNMAANNISYVIWQQ 
KFYSNTNSIYGPANTWNAMPDRGGVTANHYDHVHVSF 



Sequence description 



A] Length: 419 bp - 139 aa (partial sequence) 

B] N- and C-termini have yet to be determined 



ID- 149 

Clone 3-94 

(SEQ ID NO: 143) 

ATGATTCCAGTAGTTATTGAACAAACAAGTCGTGGTGAACGTTCTTATGAT 

ATTTACTCACGTCTTTTAAAAGATCGTATTATTATGTTGACAGGCCAAGTT 

GAGGATAATATGGCCAATAGTATCATTGCACAGTTATTGTTTCTCGATGCA 

CAAGATAATACAAAGGATATTTACCTTTATGTCAATACACCAGGTGGTTCA 

GTATCGGCTGGACTTGCTATTGTGGACACCATGAACTTCATTAAATCGGAC 

GTACAGACGATTGTTATGGGGATGGCTGCTTCGATGGGAACCATTATTGCT 

TCAAGTGGTGCTAAAGGAAAACGTTTTATGTTACCGAATGCAGAATATATG 

ATCCACCAACCAATGGGCGGAACAGGCGGAGGTACACAGCAATCTGATAT 

GGCTATCGCTGCTGAGCATCTTTTAAAAACGCGTCATACTTTAGAAAAAAT 

CTTAGCTGATAATTCTGGTCAATCTATTGAAAAAGTCCATGATGATGCAGA 

GCGTGATCGTTGGATGAGTGCTCAAGAACACTTGATTATGGCTTTATTGAT 

GCTATTATGGAAAATAATAATTTACAATAATAGATTTAAAAGAGTTGAGTT 

TACCAACTCTTTTTTTATTTGTTGGAATTATGTTATAATCTTAGTAATTACA 

GATATGACGCAGAAAGGAAAAAATTATTGA 

(SEQ ID NO: 144) 

MIPVVEEQTSRGERSYDIYSRLLKDRIIMLTGQVEDNMANSIIAQLLFLDAQDN 
TKDIYLYVNTPGGSVSAGLAIVDTMNFIKSDVQTIVMGMAASMGTIIASSGAK 
GKRFMLPNAEYMIHQPMGGTGGGTQQSDMAIAAEHLLKTRHTLEKILADNSG 
QSIEKVHDDAERDRWMSAQEHLIMALLMLLWKinYNNRFKRVEFTNSFFICW 
NYVI1LV1TDMTQKGKNY* - 



Sequence description 



A] Length: 693 bp - 231 aa (full length gene) 

B] Putative ATG start codon is preceded by a 
typical Shine-Dalgarno sequence. No leader 
peptide. Significantly, it would appear to have a 

5 very hydrophobic C-terminus. 



ID- 150 



10 Clone 2-c86 

(SEQ ID NO: 145) 

ATGAAACCAAAAaTTATTGGTGTACTTGGTCTAGGAATATTTGGACAAACA 
CTCGCACAAGAACTAAGTAACTTTGAACAAGATGTTATTGCTATTGACAGC 

15 AATCCTGAAAATGTACAAGCTGTCGCCGAAGT 

TGTTACAAAAGCAGCTATCGGAGACATTACTGATTTAGCTTTCCTAAAACA 
CATCGGGATCAGTGACTGTGATACTGTTATTATTGCTACAGGAAACAGTTT 
AGAGAGCTCAGTATTGGCCGTAATGCACTGTAAAAAGTTAGGCGTCCCAC 
AAGTTATTGCTAAAGCTCGAAACCTTGTATACGAAGAAGTACTTTATGAAA 

20 TTGGTGCTGATTTGGTTATCTCTCCGGAGCGAGAATCTGGGCAAAATGTTG 
CTGCAAACCTCATGAGAAATAAAATTACAGATGTCTTCCAGATTGAATCTG 
ATATTTCTGTCATTGAATTT 

(SEQ ID NO: 146) 

25 MKPKIIG VLGLGIFGQTLAQELSNFEQD VIAIDSNPEN VQA V AEV VTKAAIGDI 
TDLAFLKHIGISDCDTVIIATGNSLE 

SSVLAVMHCKKLGVPQVIAKARNLVYEEVLYEIGADLVISPERESGQNVAAN 
LMRNKITDVFQIESDISVIEF 

30 

Sequence description: 



A] Length: 459 bp - 153 aa (partial sequence) 
35 B] Putative ATG start codon is preceded by a 

typical Shine-Dalgarno sequence. Possesses a 
potential leader peptide sequence. 
This orf is not in frame with nuc 

40 

ID-151 



Clone 2-c88 



(SEQ ID NO: 147) 

GTGCGTTATAGTAAAGAGATTATTCAGTTAGCTATACCAGCTATGATTGAA 

AATATCTTACAAATGCTCATGGGAGTAGTTGATAATTATCTAGTGGCTCAG 

TTAGGTGTTGTAGCAGTATCAGGTGTTTCAGTTGCTAATAATATAATTACT 

ATTTATCAAGCTATTTTTATAGCTTTAGGGGCGAGTATAGCAAGTCTATTG 

GCCAAGTCGTTAGCAGGTAGTGAGAAGGATGATGCAATTTCAGTATGTTCT 

CAAGCCATTTTTCTAACATCACTGATAGGGGCAGTATTAGGAATTATCTCG 

ATTGTTTTT GG AC AA ACTTTCTTT 

(SEQ ID NO: 148) 

MRYSKEnQLAIPAMIENILQMLMGVVDNYLVAQLGVVAVSGVSVANNUTIY 
QAIFIALGASIASLLAKSLAGSEKDDAISVCSQAIFLTSLIGAVLGIISIVFGQTFF 



Sequence description 

A] Length: 330 bp - 1 10 aa (partial sequence) 

B] Putative GTG start codon is preceded by a 
typical Shine-Dalgarno sequence. May have a 
leader peptide 

ID- 152 
Clone 2-c92 
(SEQ ID NO: 149) 

TTGATTAACAAGTATTCGTGCTTTTTGAAGAGGATTCTCCATAATAATACT 

CCTTTAATAGTTATCGTGAGAAGTATTTTAAAGAAAAACCGCCAAGGTAG 

AGCGACATTTCTGCCTTTAACTACAATAAAACCAAGAGAATTAGCACAAC 

ATTATCTCTCAAAATTACAAAGTTCTCAAGGGTTTTTAGGAATAGCTAGTG 

AATTGGTAACCTATGATCAACGCTTGTCAAACATTTTT 

(SEQ ID NO: 150) 

MINKYSCFLKRILHNNTPLIVIVRSILKKNRQGRATFLPLTTIKPRELAQHYLSK 
LQSSQGFLGIASELVTYDQRLSNIF 

Sequence description - - 



A] Length: 240 bp - 80 aa (partial sequence) 

B] No obvious Shine Dalgarno sequence precedes the Putative TTG start 
codon 



'I 

ID- 153 



5 Clone 2-c94 

(SEQ ID NO: 151) 

TTGTTGACTCACAAAAATATATTATTAACCATTATATTTGGATTATTTATGA 
TTATATTATCAGCATGTGGTATGTCTAATAAGGAAATGGCTGGTATTGATA 

1 0 ATTGGGA AC ATTATC AAAAGG A A A AG AAA ATT ACTATTGGATTTG ATA AT 
ACTTTTGTTCCTATGGGATTTGAAAGTCGTTCTGGTGACTATACCGGCTTTG 
ATATTGATTTAGCTAATGCTGTTTTTAAAGAATACGGTATTTCAGTGAAAT 
GGCAGCCTATTAACTGGGATATGAAAGAAACTGAACTTAATAATGGTAAT 
ATAGACCTTATTTGGAATGGTTATTCAAAAACGGCAGAACGTGCTAAAAA 

1 5 AGTCGCTTTTAC A A ACCC AT ATATG AATAATC ATC AAGTAATTGTT ACT AA 
AACTTCATCACATATTAATAGTATTAAGGATATGAAGGGGAAAAAACTAG 
GAGCCCAGTCGGGTTCATCTGGTTTTGATGCTTTTAACGCTAAACCTGATA 
TTTTAAAAAAGTTTGT AAAAGG AAAAGAAGCAGTTCAATACGATACTTTC 
ACTCAGGCTTTGATTGATTTAAAAAATAACCGTATTGATGGTCTTTTGATT 

20 GATGAAGTTTATGCTAACTATTATTTAAAGCAAGAAGGAA 

(SEQ ID NO: 152) 

MLTHKNILLTIIFGLFMIILSACGMSNKEMAGIDNWEHYQKEKKITIGFDNTFV 
PMGFESRSGDYTGFDIDLANAVFKEYGISVKWQPINWDMKETELNNGNIDLr 
25 WNGYSKTAERAKKVAFTNPYMNNHQVIVTKTSSHINSIKDMKGKKLGAQSG 
SSGFDAFNAKPDILKKFVKGKEAVQYDTFTQALIDLKNNRIDGLLIDEVYANY 
YLKQEG 



30 Sequence description 

A] Length: 649 bp - 216 aa (partial sequence) 

B] TTG start codon is preceded by a possible 
35 typical Shine-Dalgarno sequence. Has a 

leader peptide 



40 ID- 154 

Clone 2-clOO 



(SEQIDNO: 153) 

ATGAAAATTTGGAAAAAAATAACCTTAATGTTTTCTGCAATTATTTTAACA 

ACAGTAATTGCATTGGGAGTCTATGTTGCCTCAGCTTATAATTTTTCGACTA 

ATGAATTGTCTAAGACTTTT 

5 

(SEQ ID NO: 154) 

MKIWKKITLMFSAIILTTVIALGVYVASAYNFSTNELSKTF 



10 Sequence description 

A] Length: 123 bp - 41 aa (partial sequence) 

B] ATG start codon is preceded by a potential 
1 5 typical Shine-Dalgarno sequence. Has a 

typical leader peptide 

ID-155 

20 

Clone 2-cl 

(SEQ ID NO: 155) 

ATGAAAAAACAAAGACTATTACTGCTTTTTGGAGGCTTATTAATAATGATA 
25 ATGATGACAGCATGTAAGGATTCAAAAATCCCAGAAAACCGCACGAAAAA 
GGAATACCAGGCAGAACAGAATTTTAAGTCATACTTTAAATATATATCAG 
ATAAAAATAACTATTTAGATAATATAAAAGTTTATTACTTTTCTATAAGTA 
TTTCTAAAGATGTACAAGATAAAGTCAGTGAAACAACAACTTGTTCATATA 
GACTAGAAAAGCAAAAGAATCAAGAGTTCATTGGTAATTTTGAACATGAA 
30 GTTAGTGAATCTAGTCAATATTCAACCGAAGTTAAAAATCAAATACAGTAT 
CCAATCCAGTATAAAGATAATTCAATTCGTTTTACTGAAAAAACACCGTCA 
GAACGTTATGATGAGTTTGTTTTTAGTTCATTTGATTCTTCATTATTAAAAA 
AATATAAAATATATGATTACTTACTAAAACATCCCGAAACTGAATTAAAA 
GGTGTTTCCTATAAGATTCCTATAAATTCTGAAATTGTAGCCCCTTTTATAA 
3 5 ATC A ATT AA AT AT A A A AA ATCCT A A A A A ATC ATCT ATTTCGGTT AC A A A A A 
CGGAAAGTAAAGAATATTATTATACAATCAGTATTGATACTGATTCTGAGA 
TATATTCTATATTCGAAGGTATTCAT 

(SEQ ID NO: 156) 

40 MKKQRLLLLFGGLLIMIMMTACKDSKIPENRTKKEYQAEQNFKSYFKYISDKN 
NYLDNIKVYYFSISISKDVQDKVSETTTCSYRLEKQKNQEFIGNFEHEVSESSQ 
YSTEVKNQIQYPIQYKDNSIRFTEKTPSERYDEFVFSSFDSSLLKKYKIYDYLLK 
HPETELKGVSYKIPINSETVAPFINQLNIKNPKKSSISVTKTESKEYYYTISIDTDS 
EIYSIFEGIH 



Sequence description 



5 

A] Length: 687 bp - 229 aa (partial sequence) 

B] ATG start codon is preceded by a potential 
typical Shine-Dalgarno sequence. Has a 
typical leader peptide. C-terminus has yet to be 

10 verified 



ID-156 
15 Clone 2-c5 

(SEQIDNO: 157) 

ATGACATTTGACACCATTGATCAATTAGCGGTTAATACAGTCCGCACGCTT 
TCTATTGATGCTATCCAAGCAGCAAATTCTGGGCACCCAGGTCTTCCTATG 
20 GGAGCTGCGCCTATGGCTTATGTGCTTTGGAATAAATTCTTAAATGTAAAC 
CCAAAAACAAGTCGCAATTGGACAAACCGTGACCGTTTTGTACTTTCAGCT 
GGGCATGGTTCAGCTCTTCTTTATAGCCTACTTCATTTAGCTGGCTATGATT 
TATCAATTGATGATTT 

25 (SEQ ID NO: 158) 

MTFDTIDQLAVNTVRTLSIDAIQAANSGHPGLPMGAAPMAYVLWNKFLNVNP 
KTSRNWTNRDRFVLSAGHGSALLYSLLHLAGYDLSEDD 



30 Sequence description 



A] Length: 272 bp - 90 aa (partial sequence) 

B] ATG start codon is preceded by a potential 
35 typical Shine-Dalgarno sequence. No obvious 

leader peptide 

ID-157 

40 

Clone 2-c8 



(SEQIDNO: 159) 

ATGAGAACACTATTTAGAATGATATTTGCTATTCCAAAGTTTATCTTTAGA 
TTGATTTGGAATATCATTTGGGGAATATTCAAGACAGTTCTTGTTATTGCG 
ATTATTTTATTTGGCTTGTATTACTATGCGAATCACAGTCAATCAGAATTTG 
CTAATCAACTTAGTGACATTATTCAGACAGGAAAAACATTTTT 

(SEQIDNO: 160) 

MRTLFRMIFAIPKFIFRLIWNIIWGIFKTVLVIAIILFGLYYYANHSQSEFANQLS 
DIIQTGKTF 



Sequence description 

A] Length: 197 bp - 65 aa (partial sequence) 

B] ATG start codon is preceded by a potential 
typical Shine-Dalgarno sequence. Possesses a 
leader peptide 

ID- 158 
Clone 2-c9 
(SEQIDNO: 161) 

ATGTCAAAAAAAATAATATTAGGAATTTTATCTCTTTTATCTGTCGTTACTT 
TGGTGGCGTGTGGTTCATCAGACAAACAGCTACAAGATAAAGTTGAGAAA 
AAAGGGAAGTTAGTTTTAGCGGTGAGTCCAGATTATGCTCCCTTTGAGTTT 

(SEQIDNO: 162) 

MSKKIILGILSLLSVVTLVACGSSDKQLQDKVEKKGKLVLAVSPDYAPFEF 



Sequence description 

A] Length: 153 bp - 51 aa (partial sequence) 

B] ATG start codon is preceded by a potential 
typical Shine-Dalgarno sequence. Possesses a 
leader peptide (not in frame with nuc) 



ID-159 



Clone 2-clO 



(SEQ ID NO: 163) 

5 ATGAAAAATCAAAGACTATTACTGCTTTTTGGAGGCTTATTAATAATGATA 
ATGATGACAGCATGTAAGGATTCAAAAATCCCAGAAAACCGCACGAAAAA 
GGAATACCAGGCAGAACAGAATTTTAAGTCATACTTT 

(SEQ ID NO: 164) 

1 0 MKNQRLLLLFGGLLIMIMMTACKDSKIPENRTKKE YQAEQNFKS YF 



Sequence description 

15 

A] Length: 139 bp - 46 aa (partial sequence) 

B] ATG start codon is preceded by a potential 
typical Shine-Dalgarno sequence. Possesses a 
leader peptide 

20 

ID- 160 
Clone 2-cll 

25 

(SEQ ID NO: 1 65) 

ATGATTGGAAAATTATATTATAGCTATAGAAAGTCACGCTTATTAAGAAGT 
ATTTTATGGCTTATTTTAATTGTTGGTGTATATATGTTAGGACAACGTGTTT 
TATTATCCACTGTTCCTTTATCACATCAAGAGATAAAACTAGCAGTAGATC 

30 AACATTTACTCAATAACTTTTCAGCAGTAAGTGGTGGGAGTTTTAATAAAT 
TAAATGTTTTCACACTGGGGTTGAGTCCATGGATGTCAAGTATGATTATTT 
GGAGATTCGTTTCCTTATTTTCGTGGGCAAAAAATGCAACGAAGCGAAAA 
GCAGAAGTAGCTCAATATACTTTAATGCTTACTATCTCAGTTATACAAGCA 
TATGGTGTTTCAGGAAATCAATTTATAAAAAGCTCTTTATTAGGTTCTTATA 

35 GTGATATTG IT IT 1 

(SEQ ID NO: 166) 

MIGKLYYSYRKSRLLRSILWLILIVGVYMLGQRVLLSTVPLSHQEIKLAVDQHL 
LNNFSAVSGGSFNKLNVFTLGLSPWMSSMIIWRFVSLFSWAKNATKRKAEVA 
40 QYTLMLTISVIQAYGVSGNQFIKSSLLGSYSDIVF 



Sequence description 



A] Length: 423 bp - 141 aa (partial sequence) 

B] ATG start codon is preceded by a potential 
typical Shine-Dalgarno sequence. Possesses a 
leader peptide 

ID-161 
Clone 2-cl3 
(SEQ ID NO: 167) 

ATGAAAGGTCTATTGGATTTTTTAGTTAATATTGCCAGAACGCCAGCTATT 

TTAGTCGCCTTGATAGCCATTATCGGTTTAGTACTGCAGAAAAAAGGTGTT 

CCTGATATTGTAAAAGGTGGAATAAAAACATTTGTTGGCTTCTTAGTGGTT 

TCTGAAGGTGCAGGGATAGTCCAAAATTCCTTGAATCCATTTGGAAAAATG 

TTTGAACATGCTTTTCATTTGGTGGGGGTAGTTCCTAATAATGAAGCCATT 

GTAGCAGTAGCTCTTACGAAGTATGGCTCAGCAACTGCTTTGATTATGTTA 

GCGGGAATGATTTTTAATATTTTAATTGCTCGTTTTACAAAA 

(SEQ ED NO: 168) 

MKGLLDFLVNLARTPAELVALIAIIGLVLQKKGVPDrVKGGIKTFVGFLVVSEG 
AGIVQNSLNPFGKMFEHAFHLVGVVPNNEAIVAVALTKYGSATALIMLAGMI 
FNILIARFTK 



Sequence description 

A] Length: 348 bp - 1 16 aa (partial sequence) 

B] ATG start codon is preceded by a potential 
Shine-Dalgarno sequence. Possible leader 
peptide 

ED- 162 
Clone 2-c21 



(SEQ ED NO: 169) 

TTGGTTGGTAAGCCCCAATTACTATTTTTAGATGAACCTACTTCCGGAATG 
GATACTTCCACACGTCAACGATTTTGGAAGCTGGTTGCGACACTAAAAAA 
AGAAGGTGACACAATTGTCTATTCTAGTCATTATATCGAAGAGGTAGAAC 
ATACAGCTGATAGGATTTTAGTACTTCATAAAGGAAAGTTATTACGCGATA 



CAACCCCCTTTGCCATGAAGCAAGAAAAAACCGAAAAGTTATTCACCGTT 

CCGCTTAGTTATCAAAAATTATTACCTACCTATTTGATTACAGAGTGTGAA 

GCCAAGAGTGATAGTATAACGTTTGTTACTGGGGAGGCTGAAACTGTATG 

GAAAATACTGGCAGATAATGGTTGTCCTATTGAAGCTATTGAGATGACCA 

ATAGAACTTTGTTAAATCGTATTTTTGAGACTACTAAGGAGGTAAAACATG 

AGAATCTTTA 

(SEQ ID NO: 170) 

MVGKPQLLFLDEPTSGMDTSTRQRFWKLVATLKKEGDTIVYSSHYIEEVEHTA 

DRILVLHKGKLLRDTTPFAMKQEKTEKLFTVPLSYQKLLPTYLITECEAKSDSI 

TFVTGEAETVWKILADNGCPffiAffiMTNRTLLNRIFETTKEVKHENL 



Sequence description 



A] Length: 462 bp - 155 aa (partial sequence) 

B] B] Putative TTG start codon is not preceded by 
an obvious Shine-Dalgarno sequence. No obvious 
leader peptide. N- and C- termini require further 
examination. 



ID-163 
Clone 2-c25 
(SEQ ID NO: 171) 

TTGAAAAAATCCAAGAGAAGCCGTAAGGCAGTGACAACAAGTGGTGAGA 

AGACTTTACTTGAGGATTTGGCAAAAATGAATTTCCTAGACGAAGTCATTA 

ATGTTATGGTTTTATATACCTTGAATAAGACAAAATCTGCTAACTTAAATA 

AGGCCTATATCATGAAAGTTGCTAATGATTTTGCCTTTCAGAATGTTATGA 

CGGCCGAAGATGCTGTGCTTAAAATTCGTGATTTTTCAGATCAAAAAGTAA 

GGACTAAAACAGAAACGAAGAAGAAACAATCGAATGTTCCTGAATGGAGT 

AATCCTGATTATAAAGATGAGGTTAGCCCAGAAAAAGAAATTGAATTAGA 

ACAGTTT 

(SEQ ID NO: 172) 

MKKSKRSRKAVTTSGEKTLLEDLAKMNFLDEVINVM VLYTLNKTKS ANLNK 
AYIMKVANDFAFQNVMTAEDAVLKIRDFSDQKVRTKTETKKKQSNVPEWSN 
PDYKDEVSPEKEIELEQF 



Sequence description 



A] Length: 360 bp - 120 aa (partial sequence) 

B] N- and C- termini require verification. 



ID- 164 
Clone 2-c28 
(SEQ ID NO: 173) 

ATGACGAATCATATTACTAAACTGATAGAAAATAGCGGAAAAAAATTGAC 

AGAAATTAGCGAAGCTACAGATATAGCCTATCCTACACTTTCTGGATACAA 

TCAAGGAATCCGCAAACCTAAAAAAGATAATGCTGAAAAATTGGCAAAAT 

ACTTTAATGTTTCCGTCGCTTACATTATGGGACTTGATAGCAACCCACATG 

CTCCATCAAATCTT 

(SEQ ID NO: 174) 

MTNHITKLIENSGKKLTEISEATDIAYPTLSGYNQGIRKPKKDNAEKLAKYFNV 
SVAYIMGLDSNPHAPSNL 



Sequence description 

A] Length:218 bp - 72 aa (partial sequence) 

B] ATG start codon is preceded by an 
obvious Shine Dalgarno sequence. No obvious 
leader peptide. 

ID-165 
Clone 2-c29 
(SEQ ID NO: 175) 

TTGATGAAAAGGAATAAACATTTACCGTTAACAGAAACTACCTATTATATT 

TTATTAGCTTTGTTTGAGGAAGCGCATGGCTATGCTATTATGAAAAAAGTT 

GAAGAAATGAGTGGCGGTGATGTTAGAATAGCCGCAGGGACAATGTACGG 

TGCCATTGAAAATTTACTTAAAGAAAAATGGATAAAGTCTATCTCAAGTGA 

CGATAGAAGAAGAAAAGTTTATATTATTACTGAGACAGGAAAAGAAATAG 

TAGAACTTGAAACGAATCGATTAAGAAAGTTACTTAATACTGCTAATCAGT 

TGGGTTTTGGAGGAGATGGTTATGATAAAGTTT 



(SEQ ID NO: 176) 

MMKRNKHLPLTETTYYILLALFEEAHGYAIMKKVEEMSGGDVRIAAGTMYG 



AJENLLKQKWIKSISSDDRRRKVYIITETGKEIVELETNRLRKLLNTANQLGFG 
GDGYDKV 

5 

Sequence description 



10 A] Length: 337 bp - 1 12 aa (partial sequence) 

B] TTG start codon is preceded by an 
obvious Shine Dalgarno sequence. Actual start 
codon may ATG that comes immediately after the 
TTG. Potential leader peptide. 

15 

ID- 1 66 
Clone 2-c35 

20 

(SEQ ID NO: 177) 

CCCATTACTGGTGAGTTAATAGCTGAGAAATTAGGAGTACCAAGAGCAGC 
ACTAAGGTCTGATTTGCGGGTTTTAAGTATGCTAGGTATCATAGATGCAAA 
ACCTAAGGTTGGTTATTTTTATTTAGGACAGTATCATGCTTCAATAGGGAC 

25 AAGTCATTTTGAAAAGATGACAGTTTCAGAAATTATGGGGATCCTTCTGAC 
AGTTCATCAAAAAGATTCAGTTTATGATGTTATTGTACATATTTTTATGGA 
AGATGCTGGTTGTGCTTTTATCTTGGATGATGATGATTTTCTCTGTGGAGTC 
GTGTCACGTAAAGATTTACTAAAAACCAGTATTGGCGGAGGAGATCTTTCT 
AAAATGCCAATAGGAATGGTGATGACACGTATGCCACACGTGACAACTGT 

30 TTTAGAAAATGAAAGTCTTTTTGCGGCAGCTGATAAATTAGTGAGCAGAA 
AAGTGGATAGTCTCCCTGTCGTTCGTCATGATAAGCAATATCCCGAAAAAT 
TTA 

(SEQ ID NO: 178) 

35 PITGELIAEKLGVPRAALRSDLRVLSMLGIIDAKPKVGYFYLGQYHASIGTSHF 
EKMTVSEEVIGILLTVHQKDSVYDVIVHIFMEDAGCAFILDDDDFLCGVVSRKD 
LLKTSIGGGDLSKMPIGMVMTRMPHVTTVLENESLFAAADKLVSRKVDSLPV 
VRHDKQYPEKF 

40 

Sequence description 



A] Length:51 1 bp - 170 aa (partial sequence) 



B] N- and C-termini to be determined 



ID- 167 

5 

Clone 2-44 
(SEQIDNO: 179) 

TTGGAAGTCATCATGCAATTTATTTATAGTATTATTGGTATTTTATTGGTAT 
1 0 TAGGAATTGTGTATGC A ATTTCTTTC AATCGTAAGAGTGTTTCTCTA AGTTT 
AATTGGAAAAGCTCTTATCGTTCAATTCATTATTGCGCTAATCTTAGTACGT 
ATCCCACTAGGCCAACAAGTTGTTAGTGTTGTTTCAACTGGAGTTACTAAA 
GTAATCAACTGTGGTCAAGCTGGTTT 

15 (SEQIDNO: 180) 

MEVIMQFIYSIIGILLVLGIVYAISFNRKSVSLSLIGKALIVQFIIALILVRIPLGQQ 
VVSVVSTGVTKVINCGQAG 



20 Sequence description 

A] Length: 23 3 bp - 77 aa (partial sequence) 

B] TTG start codon is preceded by a 

25 possible Shine Dalgarno sequence. Actual start 

codon may occur further downstream. Potential 
leader peptide. 

30 DD-168 

Clone 2-46 
(SEQIDNO: 181) 

35 CAACCTAATAAAGCTTTAGAAAGTGATGAGATTGATATTAATGCTTTCCAG 
CATTATAATTACTTAACCAATTGGAATAAAGCAAATAAGACCAATCTTGTT 
TCCGTTGCTGAGACATACTTTACTTCCTTTAGATTATACTCTGGTACTAAGA 
ACGGTAAAGGTAAATACCAAACAGTTTCTGAAATTCCAAATAAAGCAACT 
ATTACTATCCCAAACGATGCAGTTAACGAAAGTCGCTCTCTCTACTTGTTA 

40 CAATCAGCAGGCTTGCTAAAATTGAAAGTATCAGGTGATACATTAGCAAC 
AATGTCAGATGTTGTTTCCAATCCTAAATCTTTAGATTT 



15 



30 



40 



(SEQIDNO: 182) 

QPNKALESDEroiNAFQHYNYLTNWNKANKTNLVSVAETYFTSFRLYSGTKN 
GKGKYQTVSEIPNKATITIPNDAVNESRSLYLLQSAGLLKLKVSGDTLATMSD 
VVSNPKSLD 



Sequence description 



10 A] Length:344 bp - 1 14 aa (partial sequence) 

B] N- and C- termini require verification 



ID- 169 
Clone 2-47 



(SEQ ID NO: 183) 

20 ATGAAATGTATAATAAATAATATAAATAAAATAAAAATGATAATTGAGAT 
TTATCATAGAAGGAAAACTATTTTGAAATTAAATAAAATCATATTATCTAC 
TGCAGCTCTTACTGCTCTCTTTTTAGGATATAATAGCGTTACTGCGGATACA 
TATAATAACTATCAGCCACATAGATCAAATAATATGGATTTAACTGAGGA 
ATATAACTATAATAACCAGATAGAACTTCAGGAGCGTATAAAAAACCTAA 

25 ATATACCTTTT 

(SEQIDNO: 184) 

MKCIINNINKIKMIIEIYHRRKTILKLNKIILSTAALTALFLGYNSVTADTYNNY 
QPHRSNNMDLTEEYNYNNQIELQERIKNLNIPF 



Sequence description 



35 A] Length:264 bp - 88 aa (partial sequence) 

B] There is a Shine-Dalgarno sequence upstream 
of this sequence. Potential leader peptide 
sequence 



ID- 170 



Clone RS-58b 



(SEQIDNO: 185) 

TTGGGTGATTATTATGGTAAGAAATATTTTGGTGAGGCAGCTAAAAAAGA 
CGTCGAACATATGGCTAAGAAAATCATTAATGTCTATAAAACACGGTTAA 
AAAACAACACTTGGTTATCAGAAAATACAAAAGCAATGGCCATTAAGAAA 
5 CTTGATAACATGAGATTAATGATTGGCTATCCAGAAGATTATCCTGATCTT 
TATCGTCAGTACCAATTTGATAGTAAAGCAAGCTTCTTTGAAAACAATGAT 
AACTACAGAAAATTATCGAACAAGAAAACATTTGAAGAATTTAACCAGTC 
TAATCAACGTGAACATTGGCAAATGAGTGCCAATGCTGTAAATGCTTATAA 
TGATCCTAATACCAATTCCATAGTCTTTCCAGCAGCGATTTTTCAATCACCA 

1 0 CTGT ACG AT A A A ACT A A A AC AGTTAGTC A A A ATT ATGG AGCT ATCGG AGC 
AATTATTGGTCATGAAATTTCACACTCATTTGATATTAATGGTATGAAATA 
TGACGAGAAAGGGAATCTTCACGATTGGTGGACTAAAGAAGATTTAAATC 
ATTATAAGAAATCAACACAAGCTATGATTGACCAATGGGATGGCCTTAAA 
GCAGATGGCGGTAAAGTTGATGGTAAATTAACTTTAGCAGAAAATATTGC 

1 5 AG AT A ATGGTGGTGTT ATGGCATCTCT AG A AGCTCTT AAG ACTG A A A AAAT 
CCAAACTATAAAGAATTTTTTGAATCATGGGCAAGTATTTGGCGTCAAAAA 
GCAACCAAAGAACAAAGTAAGTCCTCAATTCAGTCAGATGTTCATGCACC 
ATATGAATTGAGAGCTAACATCCCAGTACGTAATTTCCAAGAATTTTATGA 
TGCCTTTGGTGTTAAAAAAGGCGATTCAATGTATCTAAAACCAGAAAAAC 

20 GTTTGACACTTTGGTAA 

(SEQIDNO: 186) 

MGDYYGKKYFGEAAKKDVEHMAKKIINVYKTRLKNNTWLSENTKAMAIKK 
LDNMRLMIGYPDYPDLYRQYQFDSKASFFENNDNYRKLSNKKTFEEFNQSNQ 
25 REHWQMSANAVNAYNDPNTNSIVFPAAIFQSPLYDKTKTVSQNYGAIGAIIGH 
EISHSFDINGMKYDEKGNLHDWWTKEDLNHYKKSTQAMIDQWDGLKADGG 
KVDGKLTLAENIADNGGVMASLEALKTEKIQTIKNFLNHGQVFGVKKQPKNK 
VSPQFSQMFMHHMN* 

30 Sequence description: 

A] Length: 819 bp - 272 aa (full length gene) 

(107 bp of additional DNA sequence (> onwards) is 
also included. While not in-frame with the 
35 described orf, it also shares strong homology 

with the neutral peptidases. 

B] This gene sequence was not identified using the LEEP system. It was 
identified downstream of the ID-89 gene which was identified by LEEP, 

during cloning and sequence analysis of the full-length ID-89 gene sequence. -- — 

40 ID-89 and ID- 170 together show homology over their combined entire length 

with the neutral endopeptidases from Lactococcus and Lactobacillus. Possesses 
TTG (possible ATG start codon located 1 3 bp further downstream) start 
codon with no obvious signal peptide. Shine Dalgarno sequence not 
immediately obvious. Possibly located further downstream 



ID-171 



5 Clone 2- 1 8/22b (Mod2) 
(SEQ ID NO: 187) 

ATGACCATGATTACGCCAAGCTTCATTAAGGTATCTGTAGATGAAACAAAT 
CGTATGATGCGTATGATATCAGATTTATTAAGTTTATCGCGCATTGATAAT 

1 0 GAAGT AACGC ATTT AG ATGTTG A A ATG ACG AATTTT AC AGCTTTC ATG ACC 
TCAATTTTGAATCGATTTGATCAGATTAGAAATCAAAAAACAGTCACAGG 
AAAAGTTTATGAAATTGTCAGAGATTATCCTCTTAAGTCAATTTGGGTGGA 
AATTGATACAGATAAGATGACTCAAGTGATTGATAACATTTTAAATAATGC 
AGTCAAGTATTCACCAGATGGTGGTAAGATTACAGTTAATCTACGCACAAC 

1 5 TAAAACGC AG ATGATTTTATC A AT ATC AG ACC A AGGCTTAGGTATTCCC A A 
AAAAGATTTACCTCTCATTTTTGATCGTTTTTATCGTGTTGATAAGGCGAGA 
AGTCGTCAACAGGGTGGGACTGGACTTGGTTTGTCAATTGCAAAAGAAAT 
TGTTAAGCAGCATAAGGGATTTATTTGGGCTAAGAGTGAGTATGGTAAAG 
GGTCTACTTTTACAATCGTCTTGCCTTATGATAAAGATGCTGTAACTTATGA 

20 AGAATGGGAGGACGTTGAAGATTAA 

(SEQ ID NO: 188) 

MTMITPSFIKVSLDETNRMMRMISDLLSLSRIDNEVTHLDVEMTNFTAFMTSIL 
NRFDQIRNQKTVTGKVYEIVRDYPLKSIWVEIDTDKMTQVIDNILNNAVKYSP 
25 DGGKITVNLRTTKTQMILSISDQGLGIPKKDLPLIFDRFYRVDKARSRQQGGTG 
LGLSIAKEIVKQHKGFIWAKSEYGKGSTFTIVLPYDKDAVTYEEWEDVED* 



Sequence description: 

30 

A] Length: 613 bp - 212 aa (full-length gene possibly) 

B] Possible Shine Dalgarno sequence present 
upstream of a ATG start codon. May not have yet 
determined the N- portion of this gene. No 

35 obvious signal peptide. 



ID-172 

40 Clone 2-54balternate (107b) 
(SEQ ID NO: 189) 

TTGAAAAAAATTATTACTTCTATTCTATTACTTAGTTGCATTTTTTTTATGC 
CAACCATCTCTGCTGAATCTTTTAATGCTTCCGCTAAACATGCCTTAGCAGT 



TGATTTAGATTCAGGAAAAATCTTGTATGAAAAAGATGCTAACAAACCCG 

CTGCTATTGCTTCCTTGACTAAAATAATGACCGTTTATATGGTCTATAAAG 

AAATTGATAACGGTAACCTCAAGTGGAATACCAAAGTAAATATATCTGAC 

TACCCTTATCAACTAACACGCGAATCTGATGCTAGTAATGTTCCTTTAGAA 

AAAAGGCGCTATACTGTTAAACAACTCGTGGACGCTGCCATGATTTCTAGT 

GCTAACAGTGCAGCCATTGCTTTAGCTGAACATATTTCAGGAACTGAAAGT 

AAATTTGTTGATAAAATGACTGCTCAATTGGAAAAGTGGGGAATTCATGAT 

AGCCACCTAGTCAATGCTTCTGGCTTAAATAATAGTATGTTAGGCAATCAC 

ATTTATCCAAAATCGTCACAAAACGACGAAAATAAAATGAGTGCACGTGA 

TATTGCTATTGCTGCCTACCATTTGGTCAACGAATATCCTTCCATTCTTAAG 

ATTACTAGTAAGTCCGTTGCTAAATTTGATAAAGATATTATGCATTCTTAT 

AACTACATGCTACCAGATATGCCTGTCTTTAGACCAGGTATTACAGGTTTG 

AAAACTGGGACAACGGAATTAGCTGGCCAATCTTTTATTGCTACATCTACT 

GAAAGTGGAATGAGACTACTCACTGTTATTATGCATGCTGATAAGGCCGAT 

AAAGACAAATATGCTCGCTTTACAGCAACTAACTCTCTCTTGAACTATATC 

ACAAACACCTACGAACCTAACCTTGTATTAGCTAAAGGAGCTGCATATAA 

AGGTAAAGAAGCAAGTGTGAGAGACGGAAAAGAACAATCGGTCATCGCT 

GTTGCTAAAAACGATTTGAAAGTAGTACAGAAGAAAAATATCACTAAACA 

AAATCAGTTAAAAATTAACTTTAAAAAAGAGCTTACTGCTCCTATTACAAA 

AAAAGAGAACCTAGGGAAAGCTTATTACGTTGACCTTAATAAGGTTGGAA 

AAGGCTATCTCATAAAGGAACCTAGCGTTCATTTAGTGGCAAAAGATAGT 

ATTGAGCGCAGTTTCTTCCTCAAAGTGTGGTGGAATCATTTTGTGCGCTAC 

GTTAACGAAAAACTTTAA 

(SEQ ID NO: 190) 

MKKIITSILLLSCIFFMPTISAESFNASAKHALAVDLDSGKILYEKDANKPAAIA 

SLTKIMTVYMVYKEIDNGNLKWNTKVNISDYPYQLTRESDASNVPLEKRRYT 

VKQLVDAAMISSANSAAIALAEHISGTESKFVDKMTAQLEKWGIHDSHLVNA 

SGLNNSMLGNHIYPKSSQNDENKMSARDIAIAAYHLVNEYPSILKITSKSVAKF 

DKDrMHSYNYMLPDMPVFRPGITGLKTGTTELAGQSFIATSTESGMRLLTVIM 

HADKADKDKYARFTATNSLLNYITNTYEPNLVLAKGAAYKGKEASVRDGKE 

QSVIAVAKNDLKVVQKKNITKQNQLKINFKKELTAPITKKENLGKAYYVDLN 

KVGKGYLIKEPSVHLVAKDSffiRSFFLKVWWNHFVRYVNEKL* 

Sequence description: 

A] Length: 1236 bp - 412 aa (full-length gene sequence possibly) 

B] A possible Shine-Dalgarno sequence precedes the putative 'TTG' start 
codon. (needs further cloning and sequencing to verify N-terminus) 

ID-173 



Clone 3-60b 



(SEQIDNO: 191) 

ATGACGCTTCGAGAATTAACAATAGAAGAATTTAAAGAACATTCAGGAAA 
TTATGATTCACAATCATTTTTACAAACACCTGAGATGGCTAAACTTTTAGA 
5 AAAACGCGGCTATGATGTTAGGTATTTGGGATATCAAGTAGAAAATAAAC 
TAGAGATAATCAGTTTATCTTATATTATGCCAGTCACTGGTGGTTTTCAAAT 
GAAAATTGATTCAGGACCAGTTCATTCAAATTCTAAGTATCTAAAACAATT 
TTATAAAGCATTGCAAGGCTATGCCAAATCCAACGGTGTTCTAGAATTAAT 
AGTTGAGCCTTTTGATGATTACCAATTATTCACTAGTTCGGGAGTTCCTAGT 

1 0 AATC AGGG A A ATG ATA ATCTGATTG A AG ATTTT ACC AGTTC AGGTT ATC AC 
CATGATGGTTTAACAACTGGTTTTACTGGTAAATATTTATCTTGGCACTATG 
TTAAAAATTTAGAAGGTGTCACTTCTGAAACGTTACTATCTTCATTCTCTAA 
GACAGGACGAGCTTTGGTTAAGAAAGCAATGTCTTTTGGAATCAAGGTTC 
GCGTTCTTAAACGTGATGAGCTACATTTATTTAAAGAGATAACAACTTCTA 

1 5 CGTCA AATAGACGTGATT AT ATGG ATA AGTCCTTAGATTATT ATC AAG ATT 
TTTACGATAGCTTTGAAGGCAAGGCTGAATTTGTGATTGCCACTTTAAATT 
TTAGAGAATACGACCATAACTTGCAAATAAAAGCTGAAGCATTGGAAAAT 
AAGCTT 

20 

(SEQ ID NO: 192) 

MTLRELTIEEFKEHSGNYDSQSFLQTPEMAKLLEKRGYDVRYLGYQVENKLEI 
ISLSYIMPVTGGFQMKIDSGPVHSNSKYLKQFYKALQGYAKSNGVLELIVEPF 
DDYQLFTSSGVPSNQGNDNLffiDFTSSGYHHDGLTTGFTGKYLSWHYVKNLE 
25 GVTSETLLSSFSKTGRALVKKAMSFGIKVRVLKRDELHLFKEITTSTSNRRDY 
MDKSLDYYQDFYDSFEGKAEFVIATLNFREYDHNLQIKAEALENKL 



Sequence description 

30 A) Length: 771 bp - 257 aa (partial gene sequence) 

B) This gene sequence was not identified using the LEEP system. It was 
identified immediately downstream of the ID-65 gene which was identified by 
LEEP, during cloning and sequence analysis of the full-length ED-65 gene 
sequence. Sequence Characteristics: 

35 No obvious leader peptide sequence 

Orf is preceded by a potential Shine- 
Dalgarno sequence. 

40 ID- 174 

Clone 2- 17b (ID-80b) 



(SEQ ID NO: 193) 



TTGTCATTAAGTTTGGTTGCAGTGTTAAATCTTATCCCTCCTAAAATCATGG 

GATCAGTTATTGATGCTATTACAACTGGAAAATTAACAAGACCACAATTAC 

TATGGAATTTATTAGGTTTGGTTTTGTCAGCTTTAGCTATGTATGGGCTGCG 

TTATATTTGGCGTATGTATATTTTAGGGACTTCTTACAAATTAGGCCAAGTT 

GTCAGATACCGTTTATTTGAACATTTTACAAAAATGTCTCCTTCTTTTTATC 

AGAAATATCGTACAGGTGATTTAATGGCGCACGCGACCAACGACATCAAT 

TCTCTAACACGTCTTGCAGGAGGAGGAGTTATGTCAGCAGTGGATGCCTCT 

ATCACAGCATTAGTAACGCTTATCACCATGTTCTTTACTATTTCGTGGCAA 

ATGACATTAATTGCGGTTATCCCTTTGCCCTTAATGGCCTTAGCACTAGTA 

AATTGGGGCGAAAAACCCATGAAACCTTCAAAGAATCTCAGGCAGCCCTT 

TTCAGAATTAAATAATAAAGTG 

(SEQ ID NO: 194) 

MSLSLVAVLNLIPPKIMGSVIDAITTGKLTRPQLLWNLLGLVLSALAMYGLRYI 
WRMYILGTSYKLGQVVRYRLFEHFTKMSPSFYQKYRTGDLMAHATNDINSLT 
RLAGGGVMSAVDASITALVTLITMFFTISWQMTLIAVIPLPLMALALVNWGEK 
PMKPSKNLRQPFSELNNKV 

Sequence description 

A) Length: 534 bp - 178 aa (partial gene 
sequence) 

B) This gene sequence was not identified using the LEEP system. It was 
identified downstream of the ID-80 gene which was identified by LEEP, 
during cloning and sequence analysis of the full-length ID-80 gene sequence. 
Sequence Characteristics: 

No obvious leader peptide sequence 
Orf is preceded by a potential Shine- 
Dalgarno sequence. 

ID-175 

Clone 2-llAb (ID- 103b) 
(SEQ ID NO: 195) 

ATGCATATTGAGACTGTTATTGATTTCAAAGAATTAGGAAAAAGATATCGT 

TTTAAAAATCCTACAAAAGAATTAATAGCTGATACTTTAGAACAAGTCTTA 

GAAGTGATAAAAGAAGTTGATTATTATCAATCTGAAAATTATTATGTTGTT 

GGTTATTTATCTTATGAAGCATCTGCTGCTTTTGATTCACATTTTAAAGTTT 

CTCAACAGAAGTTGGCTGGAGAACATCTAGCTTATTTTACAGTACATAAAG 

ATTGTGAGAACGAAGCTTTTCCTTTAAGTTATGAAAATGTTAGATTAGCAG 

ATAATTGGACTGCTAATGTTTCTGAGCAAGAATATCAAGAGGCAATTGCTA 



ATATTAAAGGACAAATTAGACAAGGAAATACTTATCAAGTAAATTATACA 
CTAGAGCTTAGCCAACAATTATGCTCGGATCC 

(SEQ ID NO: 196) 

MHIETVIDFKELGKRYRFKNPTKELIADTLEQVLEVIKEVDYYQSQNYYVVGY 
LSYEASAAFDSHFKVSQQKLAGEHLAYFTVHKDCENEAFPLSYENVRLADNW 
TANVSEQEYQEAIANIKGQIRQGNTYQVNYTLELSQQLCSD 



Sequence description: 

A] Length: 440 bp - 146 aa (partial gene sequence) 

B] This gene sequence was not identified using the LEEP system. It was 
identified downstream of the ID- 103 gene which was identified by LEEP, 
during cloning and sequence analysis of the full-length ID- 103 gene sequence. 
Shine Dalgarno sequence present upstream of 

ATG start codon, No apparent leader peptide sequence 



ID-176 

Clone 2- 1 8/22b(b) (ID- 1 04b) 
(SEQ ID NO: 197) 

GTGAATAATATGTTTTATCTCAAAATAGCCTGGCATAATTTAAAACATTCT 

ATAGACCAGTACATACCATTCCTCTTAGCCAGTTTATTACTTTATTCATTGA 

CTTGTTCTACGCTACTAATCTTAATGAGTGCTGTTGGAAGAGATATGGGGA 

CAGCGGCAACGGTTCTTTTTCTTGGAGTGATTGTTTTGTCAATCTTTGCGGT 

AGTCATGGAACATTATAGCTACAATATCTTGATGAAACAGCGTAGTAGTG 

AATTTGGACTGTATAACATTTTGGGGATGAATAAACGTCAAGTTGCGCGTG 

TAGCTAGTCTAGAGCTGTTTATTATTTATATATTTCTTATTTCTATAGGAAG 

TCTGTTTAGTGCTTTTTTTGCTAAATTTATTTATTTAATTTTTGTCAACATTA 

TTAACTATCATGCACTAAATCTTAGTTTAAGTTTATGGCCATTTATTATTTG 

TATCGTTATATTTACAGGTATTTTTCTGACTTTAGAAGTTCCAGTTATTCGA 

CATGTTCATTTATCATCCCCATTAAGTCTTTTTAGAAAGAAACAACAGGGA 

GAAAAAGAACCAAAAGGTAATCTTATACTTGCAATTTTAGCGTTAGTAGCT 

ATCGCCATCGCTTATACAATGGCTCTTACTTCAGGTAAAGCACCTGCATTA 

GCTGTTATCTATCGTTTCTTCTTTGCAGTACTTTTAGTAATTGCTGGTACTT 

ATCTTTTTTATATTAGTTTTATGAGATGGTAGTTAAAAAGGTTGGGTGAAAA 

CAAGCATTATTATTATAAATCTGAGCATTTTGTATCAACTTCGCAAATGAT 

TTTTCGAATGAAGCAAAATGCAGTAGGGTTAGCAAGTATCACTTTATTAGC 

TGTTATGGCTCTAGTTACTATTGCTACAACAGTCTCACTCTATTCAAATACA 

CAAAATGTTGTTACCGGACTATTTCCAAAATCAGTAAGTTTATCAATAGAT 

AATTCAAAAGGTGACGCGAAAAATATATTTGAAGAAAAGATTTTGAAGAA 



ACTAGGTAAGTCATCTAAGGAAGCTATCACTTATAATCAGACAATGATTTC 
GATGCCAGTTAGTCAATCAAGTGACTTAATATCACATCTA 

(SEQ ID NO: 198) 

MNNMFYLKIAWHNLKHSIDQYIPFLLASLLLYSLTCSTLLILMSAVGRDMGTA 

ATVLFLGVIVLSIFAVVMEHYSYNILMKQRSSEFGLYNILGMNKRQVARVASL 

ELFUYIFLISIGSLFSAFFAKFIYLIFVNIINYHALNLSLSLWPFnCIVIFTGIFLTLE 

VPVIRHVHLSSPLSLFRKKQQGEKEPKGNLILAILALVAIAIAYTMALTSGKAP 

ALAVIYRFFFAVLLVIAGTYLFYISFMTWYLKRLRQNKHYYYKSEHFVSTSQM 

IFRMKQNAVGLASITLLAVMALVTIATTVSLYSNTQNVVTGLFPKSVSLSIDNS 

KGDAKNIFEEKILKKLGKSSKEAITYNQTMISMPVSQSSDLISHL 



Sequence description: 

A] Length: 1 1 19 bp - 373 aa (partial gene sequence) 

B] This gene sequence was not identified using the LEEP system. It was 
identified upstream of the ID- 104 gene which was identified by LEEP, during 
cloning and sequence analysis of the full-length ID- 104 gene sequence. 
Possible Shine Dalgarno sequence present 

upstream of a GTG start codon. Possesses a potential 
leader peptide sequence 

ID-177 

Clone 2-5b (ID- 112b) 
(SEQ ID NO: 199) 

ATGGTTGAGCCAATTATTTCAATACAAGGACTTCATAAAAGTTTTGGGAAA 

AATGAGGTTTTAAAAGGCATTGACTTGGATATTCATCAAGGAGAAGTGGT 

GGTTATTATTGGCCCTTCTGGCTCTGGTAAGTCAACATTTTTAAGAACAAT 

GAATCTCTTGGAAGTACCAACAAAGGGAACAGTGACTTTTGAAGGGATTG 

ATATAACAGACAAAAAGAATGATATTTTTAAAATGCGCGAAAAAATGGGC 

ATGGTTTTTCAACAGTTCAATCTATTTCCCAATATGACTGTACTAGAAAAT 

ATTACTTTATCACCTATTAAGACAAAGGGACTTTCTAAGCTTGATGCTCAG 

ACAAAAGCATACGAGCTACTTGAAAAAGTTGGACTCAAAGAGAAGGCTAA 

TGCTTATCCAGCAAGCTTATCTGGAGGACAACAACAACGGATTGCTATTGC 

AAGAGGTCTTGGAATGAATGGTGATGTCCTTCTTTTTGATGAAGGTACTTCA 

GCTCTTGATCCTGAAATGGTAGGTGAAGTCTTGACTGTTATGCAAGATTTA 

GCTAAATCTGGTATGACGATGGTTATTGTCACTCATGAAATGGGTTTTGCA 

CGTGAAGTAGCGGATCGTGTCATTtTTATGGATGCAGGGATTATTGTTGAG 

CAAGGGACCCCTAAGAAAGTATTTGAGCAGACAAAAGAAATCCGCACAAG 

AGACTTCTTAAGTAAAGTATTATAA 



(SEQ ID NO: 200) 

MVEPDSIQGLHKSFGKNEVLKGIDLDfflQGEVVVIIGPSGSGKSTFLRTMNLLE 
VPTKGTVTFEGIDITDKKNDIFKMREKMGMVFQQFNLFPNMTVLENITLSPIKT 
5 KGLSKLDAQTKAYELLEKVGLKEKANAYPASLSGGQQQRIAIARGLAMNPDV 
LLFDEPTSALDPEMVGEVLTVMQDLAKSGMTMVIVTHEMGFAREVADRVIF 
MD AGII VEQGTPKK VFEQTKEIRTRDFLS K VL* 



10 Sequence description: 

A] Length: 735 bp - 244 aa (full length gene) 

B] This gene sequence was not identified using the LEEP system. It was 
identified downstream of the ID-112 gene which was identified by LEEP, 

15 during cloning and sequence analysis of the full-length ID-1 12 gene sequence. 

Shine-Dalgarno sequence precedes the 'ATG' 
start codon. No obvious leader peptide 

20 ID-178 

Clone 2-5c (ID-1 12c) 
(SEQ ID NO: 201) 

25 ATGTCTCAsTATCAAGAGTGGTTAGAAAACGACTCACTCGGTAAAGATATT 

AAGTCAGATTTAGAAGCTATTAAAGGAGATGAATCTGAAATTCAGGATCG 
TTTTTACAAAACATTAGAATTTGGAACGGCGGGATTGAGAGGTAAACTTG 
GAGCAGGAACCAATCGTATGAATACTTATATGGTGGGGAAAGCAGCACAA 
GCATTAGCTAATCGATTATTGATCATGGCCCTGAAGCTATTGCACGTGGAA 
30 TTGCAGTTAGTTATGATGTCCCGTTATCAATCTAAGGAATTTGCAGAATTA 
ACTTGGTCCATTATGGCAGCAAATGGTATTAAAGCCTTATATTTA 

(SEQ ID NO: 202) 

MSHMNYKEIYQEWLENDSLGKDIKSDLEAIKGDESEIQDRFYKTLEFGTAGLR 
35 GKLGAGTNRMNTYMVGKAAQALANRLLIMALKLLHVELQLVMMSRYQSKE 
FAELTWSIMAANGIKALYL 



Sequence description: - — - - 

40 

A] Length: 366 bp - 122 aa (partial gene sequence) 

B] This gene sequence was not identified using the LEEP system. It was 
identified downstream of the ID-1 12 gene which was identified by LEEP, 



during cloning and sequence analysis of the full-length ID-1 12 gene sequence. 

Shine-Dai garno sequence preceded the 'ATG' 

start codon. No obvious potential leader peptide sequence. 

5 

ID- 179 

Clone 2-5d (ID-1 1 2d) 

10 (SEQ ID NO: 203) 

ATGCAACCTGTAAAAGTCGATGAACCTTCTGTTGAAGAAACCATTACTATT 
TTGAAAGGTATCCAAAAAAAATACGAAGATTATCATCACGTAAAATATAA 
TAATGATGCCATAGAAGCAGCTGCAGTACTATCTAATCGTTATATCCAAGA 
CCGCTTTTTACCTGATAAAGCAATAGACTTATTAGATGAAGCTGGTTCTAA 

1 5 AATG AACCTA AC ACTAAATTTTGTTGATCC A AA AG A A ATTG ATC AACGTCT 
CATTGAAGCAGAAAATTTAAAAGCGCAAGCGACTCGTGAAGAAGATTACG 
AACGTGCAGCTTACTTCCGTGACCAGATTGCAAAATATAAAGAAATGCAG 
CAACAAAAGGTCGACGATCAAGATACACCTATTATTACCGAAAAAACAAT 
TGAGCACATCATTGAAGAAAAAACGAATATCCCTGTTGGTGATTTAAAAG 

20 AAAAAGAACAATCTCAATTAATTAATCTCGCAGATGACTTGAAACAGCAT 
GTGATCGGCCAGGATGACGCTGTCATTAAGATTGCAAAAGCTATTCGTCGT 
AATCGAGTTGGTCTTGGTAGCCCAAACCGTCCTATTGGTTCCTTTTTATTTG 
TAGGACCAACCGGTGTTGGTAAAACTGAACTTTCTAAACAACTAGCAATTG 
AGCTCTTTGGTTCAGCTGATAGTATGATTCGTTTTGATATGTCAGAGTACAT 

25 GGAAAAGCATGCTGTTGCTAAATTAGTCGGAGCGCCTCCAGGATACGTGG 
GATACGAGGAAGCTGGACAACTAACTGAAAAGGTTCGTCGAAATCCTTAC 
TCGCTCATCCTTCTAGATGAAATTGAAAAAGCTCATCCCGATGTCATGCAT 
ATGTTCTTGC AGGTCCTTG ATG ACGGTCG ATT A AC AG ATGG AC A AGG A AG 
AACTGTTAGTTTTAAAGATACCATTATCATCATGACCTCAAATGCTGGTTC 

30 TGGTAAAACTGAAGCAAGTGTTGGCTTTGGTGCCTCACGAGAAGGTAGGA 
CGAATTCGAGCTCGGTACCCGGGGATCCTCTAGAGTCGACCTGCAGGCAT 
GCAAGC 

(SEQ ID NO: 204) 

35 MQPVKVDEPSVEETITILKGIQKKYEDYHHVKYNNDAIEAAAVLSNRYIQDRF 
LPDKAIDLLDEAGSKMNLTLNFVDPKEIDQRLIEAENLKAQATREEDYERAAY 
FRDQIAKYKEMQQQKVDDQDTPIITEKTIEHIIEEKTNIPVGDLKEKEQSQLINL 
ADDLKQHVIGQDDAVIKIAKAIRRNRVGLGSPNRPIGSFLFVGPTGVGKTELSK 
QLAIELFGSADSMIRFDMSEYMEKHAVAKLVGAPPGYVGYEEAGQLTEKVRR 

40 NPYSLILLDETEKAHPDVMHMFLQVLDDGRLTDGQGRTVSFKDTIIIMTSNAGS 
GKTEASVGFGASREGRTNSSSVPGDPLESTCRHAS 



Sequence description: 



A] Length: 1070 bp y 356 aa (Partial gene sequence) 

B] This gene sequence was not identified using the LEEP system. It was 
identified upstream of the ID- 112 gene which was identified by LEEP, during 

5 cloning and sequence analysis of the full-length ID-1 12 gene sequence. Shine- 

Dalgarno sequence preceded the 'ATG' 
start codon. No obvious potential leader peptide 
sequence. 

10 

ID- 180 

Clone 2-7b (ID-1 13b) 

15 (SEQ ID NO: 205) 

ATGAGAGGGAAGGTTATTTACGGCACAACCCTTATAGGTCTTTTTCTATTC 
TTATTTTTCTATTTTTGGATTCCTAAGCATCACATCGAGAGAATACATCATC 
ATCGTATAAAGCAGGTAGATGCGAAGAGTGATTTAACAGGATTTAAAACC 
CATTTGCCCATTATCAGCATTGATACAAAGCAACAAGTTATTCCTCTTGTT 

20 ACAAAAGAAGGCGGAAAATATGTCAAAGCTAGGGATAATATTAATGTTGA 
TATCGAATTACGGGATTCTCCAAGTAGATCACATCATTTATCAGAAAAGCC 
GAGAATTAGGACAAAAGGGTTAATATCATATAGAGGAAATTCCTCTCGTT 
ACTTTGATAAGAAGTCATTGAAAGTTAAGTTTGTTACTAATAAGTTAAAGG 
AAAAGAAGCATCGATTAGCAGGAATGCCTAAAGAATCGGAGTGGGTATTG 

25 CATGGTCCCTTTCTAGACAGAACATTATTAAGAAATTATCTGAGTTATAAT 
ATTGCTGGTGAGATTATGCCTATGCCCCAAACGTTCGCTACTGTGAGTTAT 
TTGTCAATGGTGAGTATCAGGGAG 

(SEQ ID NO: 206) 

30 MRGKVIYGTTLIGLFLFLFFYFWIPKHHIERIHHHRIKQVDAKSDLTGFKTHLPII 
SIDTKQQVIPLVTKEGGKYVKARDNINVDIELRDSPSRSHHLSEKPRIRTKGLIS 
YRGNSSRYTOKKSLKVKFVTNKLKEKKHRLAGMPKESEWVLHGPFLDRTLLR 
NYLSYNIAGEIMPMPQTFATVSYLSMVSIRE 

35 

Sequence description: 

A] Length: 582 bp - 194 aa (Partial gene sequence) 

B] This gene sequence was not identified using the LEEP system. It- was 

40 identified downstream of the ID-1 13 gene which was identified by LEEP, 

during cloning and sequence analysis of the full-length ID-1 13 gene sequence. 

ATG start codon is preceded by a Shine- 

Dalgarno sequence-Possesses a potential leader peptide 

sequence. C-terminus to be determined. 



ID-181 



Clone 2- 17b (ID- 117b) 
(SEQ ID NO: 207) 

CTTCACATTTTATTGATCACTATCTGACAAATGTTAATCAAACAGCAGTTCT 

TATTTTAGTGGGATATTATTCAATGTATGTCTTGCAGACCTTAATTCAATAT 

TTTGGGAATCTCTTTTTTGCGCGTGTTTCTTATAGTATTGTTAGAGATATTC 

GTAGAGATGCTTTTGCTAATATGGAAAGGCTAGGCATGTCTTATTTTGATA 

GGACACCGGCAGGATCTATTGTGTCACGTATTACTAATGATACTGAAGCAA 

TATCTGATATGTTTTCGGGTATTTTATCAAGTTTTATCTCGGCGATATTTAT 

TTTTACAGTTACTCTGTACACTATGTTGATGCTAGACATTAAACTAACAGG 

ACTCGTCGCTCTTTTGTTACCTGTTATCTTTATATTAGTGAATGTCTATCGG 

AAAAAATCAGTCACTGTCATTGCTAAAACGAGAAGTTTACTTAGTGATATC 

AACAGTAAATTATCAGAAAGTATTGAAGGAATTC 

(SEQ ID NO: 208) 

SHFIDHYLTNVNQTAVLELVGYYSMYVLQTLIQYFGNLFFARVSYSIVRDIRRD 
AFANMERLGMSYFDRTPAGSIVSRITNDTEAISDMFSGILSSFISAIFIFTVTLYT 

MLMLDIKLTGLVALLLPVIFILVNVYRKKSVTVIAKTRSLLSDINSKLSESIEGI 



Sequence description: 

A] Length: 498 bp - 165 aa (Partial gene sequence) 

B] This gene sequence was not identified using the LEEP system. It was 
identified downstream of the ID- 117 gene which was identified by LEEP, 
during cloning and sequence analysis of the full-length ID- 117 gene sequence. 
N- and C-termini have yet to be determined 

ID- 182 

Clone 3-8b (ID- 120b) 
(SEQ ID NO: 209) 

ATGTACCATA'ITGAATTAAAAAAGGAAGCTTTAGTACCAAGAGAACGCCT 

AGTTGATTTAGGCGCAGATAGATTGAGTAATCAGGAGTTATTAGCCATTCT 

CTTACGTACAGGTATTAAAGAAAAACCTGTTCTTGAAATTTCAACGCAAAT 

TTTAGAAAACATAAGCAGTTTAGCAGATTTTGGTCAATTATCCTTACAGGA 

GTTGCAATCCATTAAAGGAATCGGTCAGGTTAAATCCGTCGAAATAAAAG 

CTATGCTAGAACTAGCAAAACGGATTCACAAAGCTGAATATGATCGTAAA 



GAGCAAATTTTAAGTAGTGAACAATTAGCGAGGAAAATGATGCTCGAATT 

AGGGGATAAAAAACAAGAACATTTAGTAGCTATTTATATGGATACACAAA 

ATCGTATTATCGAACAGAGAACTATTTTTATTGGTACTGTACGTCGTTCAG 

TAGCAGAGCCAAGAGAAATTCTACATTATGCTTGTAAAAACATGGCAACT 

TCTTTGATTATTATACATAATCATCCCTCAGGTTCTCCAAATCCCAGTGAAA 

GTGATTTAAGTTTCACTAAAAAAATAAAACGATCATGTGATCATCTGGGAA 

TTGTCTGCCTAGATCACATCATCGTTGGAAAAAATAAATATTATAGTTTTC 

GAGAAGAAGCAGATATTTTATAA 

(SEQ ID NO: 210) 

MYHIELKKEALLPRERLVDLGADRLSNQELLAILLRTGIKEKPVLEISTQILENI 

SSLADFGQLSLQELQSIKGIGQVKSVEIKAMLELAKRIHKAEYDRKEQILSSEQ 

LARKMMLELGDKKQEHLVAIYMDTQNRIEEQRTIFIGTVRRSVAEPREILHYAC 

KNMATSLIIIHNHPSGSPNPSESDLSFTKKIKRSCDHLGrVCLDHUVGKNKYYSF 

REEADIL* 



Sequence description: 

A] Length: 681 bp - 227 aa (full-length gene) 

B] This gene sequence was not identified using the LEEP system. It was 
identified downstream of the ID- 120 gene which was identified by LEEP, 
during cloning and sequence analysis of the full-length ID- 120 gene sequence. 
ATG start codon is preceded by an typical 

Shine-Dalgarno sequence. No obvious leader 
peptide sequence 

ID-183 

Clone 3-1 lb (ID- 121b) ^ 
(SEQ ID NO: 211) 

TGGTTAAAAGTAGTGATAGCTTGTATTCCATCTATTTTAATTGCTTTACCAT 

TTGATAATTGGTTTGAAGCTCATTTTAATTTCATGATTCCGATTGCAATAGC 

CCTAATCTTTTATGGTTTTGTCTTCATATGGGTTGAAAAACGTAATGCACAC 

CTCAAACCACAGGTAACCGAATTGGCAAGTATGTCTTACAAGACAGCTTTC 

TTGATTGGATGTTTCCAGGTTCTCAGTATTGTTCCGGGAACCAGTCGTTCTG 

GAGCTACTATTTTAGGAGCAATTATTAITGGAACTAGTCGTTCGGTGGGTG 

CTGACTTTACTTTCTTCCTTGCCATCCCAACTATGTTTGGTTATAGTGGACT 

TAAGGCGGTTAAATATTTTTTAGATGGTAACGTCTTGAGTTTAGACCAATC 

TTTAATACTTTTAGTAGCAAGTCTGACAGCTTTCGTAGTTAGTTTATATGTT 

ATTCGTTTCTTGACAGACTATGTCAAACGACACGATTTCACCATCTTTGGT 



AAGTATCGTATAGTCTTAGGAAGTTTACTCATCCTCTACTGGTTAGTTGTTC 
ATTTATTCTAA 

(SEQIDNO: 212) 

WLKVVIACIPSIE1ALPFDNWFEAHFNFM1PIAIALIFYGFVFIWVEKRNAHLKP 
QVTELASMSYKTAFLIGCFQVLSIVPGTSRSGATILGAIIIGTSRSVAADFTFFLA 
IPTMFGYSGLKAVKYFLDGNVLSLDQSLILLVASLTAFVVSLYVIRFLTDYVKR 
HDFTIFGKYRIVLGSLLILYWLVVHLF* 



Sequence description: 

A] Length: 579 bp - 193 aa (partial sequence) 

B] This gene sequence was not identified using the LEEP system. It was 
identified downstream of the ID-68 gene which was identified by LEEP, 
during cloning and sequence analysis of the full-length ID-68 gene sequence 
described in WO 00/06736. N-terminus has yet to be determined. 

ID- 184 

Clone 3-1 lc (ID- 121c) 
(SEQIDNO: 213) 

ATGGAAATGAAACAAATCAGTGAAACAACACTGAAAATTACAATTAGTAT 

GGAAGATTTAGAAGATCGTGGTATGGAGCTGAAAGATTTCCTAATCGCTCA 

GGAGAAGACTGAGGAATTTTTCTATTCTGTCATGGATGAATTAGACTTGCC 

AGAAAACTTTAAAAATAGTGGTATGTTAAGTTTTCGAGTAACACCTAAAA 

AAGATCGCATTGATGTTTTTGTTACAAAGTCTGAATTAAGTAAAGATTTAA 

ATTTAGAAGAATTAGCAGATTTGGGTGACATTTCAAAAATGTCTCCAGAAG 

ACTTTTTTAAAACCTTGGAACAATCGATGTTGGAAAAAGGGGATACGGAT 

GCCCATGCCAAATTAGCAGAAATTGAAAATATGATGGATAAAGCAACTCA 

AGAAGTAGTTGAGGAAAATGTTTCTGAAGAACAACCTGAAAAGGAAGTAG 

AAACGATTGGATATGTTCACTATGTCTTTGATTTTGATAATATTGAAGCTGT 

AGTTGGATTTTCACAAACGATTGATTTTCCAATAGAAGCTT 

(SEQ ID NO: 214) 

MEMKQISETTLKITISMEDLEDRGMELKDFLIPQEKTEEFFYSVMDELDLPENF 
KNSGMLSFRVTPKKDRIDVFVTKSELSKDLNLEELADLGDISKMSPEDFFKTLE 
QSMLEKGDTDAHAKLAEIENMMDKATQEVVEENVSEEQPEKEVETIGYVHY 
VFDFDNIEAVVRFSQTIDFPIEA 



Sequence description: 



A] Length: 547 bp - 182 aa (Partial sequence) 

B] This gene sequence was not identified using the LEEP system. It was 
identified downstream of the ID-68 gene which was identified by LEEP, 

5 during cloning and sequence analysis of the full-length ID-68 gene sequence. 

ATG start codon is preceded by an typical 
Shine-Dalgarno sequence. No obvious potential 
leader peptide 
sequence 

10 

ID- 185 

Clone 3- 16b (ID- 122b) 

15 

(SEQ ID NO: 215) 

GGAAACCAACGGCCAGTACAATCGTCAAGGGTAGATTATCCTAAACGTAG 

TCGTGCCAAGATTGTAGAAGTTTATTTTAGACAAGCTTCTACTACTGATTA 

TTCTGGTGTTTACAAAGGTTACTATATTGACTTTGAAGCCAAAGAAACCCG 

20 GCAGAAAACTGCTATGCCTATGAAAAATTTTCATGCTCACCAAATAGAGC 
ACATGGCAAATGTATTACAGCAAAAAGGGATTTGCTTTGTCTTGCTTCATT 
TTTCCACACTTAAGGAAACCTATCTACTCCCTGCTAATGAGTTAATTTCATT 
TTATCAGATTGATAAAGGCAATAAATCAATGCCTATTGATTATATCAGAAA 
AAATGGATTTTTCGTAAAGGAGAGTGCCTTTCCTCAAGTCCCTTACTTAGA 

25 TATTATTGAAGAAAAATTATTAGGCGGTGATTACAATTAA 

(SEQ ID NO: 216) 

GNQRPVQSSRVDYPKRSRAKIVEVYFRQASTTDYSGVYKGYYIDFEAKETRQ 
KTAMPMKNFHAHQIEHMANVLQQKGICFVLLHFSTLKETYLLPANELISFYQI 
30 DKGNKSMPTDYIRKNGFFVKESAFPQVPYLDIIEEKLLGGDYN* 



Sequence description: 

35 A] Length: 447 bp - 149 aa (partial sequence) 

B] This gene sequence was not identified using the LEEP system. It was 
identified upstream of the ID- 122 gene which was identified by LEEP, during 
cloning and sequence analysis of the full-length ID- 122 gene sequence. N- 
terminus has yet to be determined 

40 

ID-186 



Clone 3- 17b (ID- 123b) 



(SEQIDNO:217) 

GGATCCTAAAAACGCTAAGGTTTATCAAAAAAATGCTGATCAATTTAGTG 
ACAAGGCAATGGCTATTGCAGAGAAGTATAAGCCAAAATTTAAAGCTGCA 
5 AAGTCTAAATACTTTGTGACTTCACATACAGCATTCTCATACTTAGCTAAG 
CGATACGGATTGACTCAGTTAGGTATTGCAGGTGTCTCAACCGAGCAAGA 
ACCTAGTGCTAAAAAATTAGCCGAAATTCAGGAGTTTGTGAAAACATATA 
AGGTTAAGACTATTTTTGTTGAAGAAGGAGTCTCACCTAAATTAGCTCAAG 
CAGTAGCTTCAGCTACTCGAGTTAAAATTGCAAGTTTAAGTCCTTTAGAAG 
1 0 C AGTTCCC AAA AAC AAT AA AG ATTACTTAG A AA ATTTGG A A ACT A ATCTTA 
AGGTACTTGTCAAATCGTTAAATCAATAG 

(SEQIDNO: 218) 

DPKNAKVYQKNADQFSDKAMAIAEKYKPKFKAAKSKYFVTSHTAFSYLAKR 
1 5 YGLTQLGIAGVSTEQEPS AKKLAEIQEFVKTYKVKTIFVEEGVSPKLAQAVAS 
ATRVKIASLSPLEAVPKNNKDYLENLETNLKVLVKSLNQ* 



Sequence description: 

20 

A] Length: 433 bp - 144 aa (partial sequence) 

B] This gene sequence was not identified using the LEEP system. It was 
identified downstream of the ID- 123 gene which was identified by LEEP, 
during cloning and sequence analysis of the full-length ID- 123 gene sequence. 

25 N-terminus has yet to be determined 



ID- 187 

30 Clone 3-46/47 (ID- 130b) 
(SEQIDNO: 219) 

ATGAAAAAAGTCATCGATTTAAAAAAACTACAAAAAGCATACGCCTCAGA 
AACTGTTTTAAATAATATTAATTTGGAGGTGTTTAAAGGAGAAATAATTGG 

35 ATTAATAGGACCCTCTGGAGCAGGGAAATCTACCTTGATTAAAACTATGCT 
TGGCATGGAAAAAGCAGATAAGGGAACAGCTCTTGTTCTTGATACTCAAA 
TGCCAGATCGTAATATTTTAAATCAAATTGGCTATATGGCTCAATCTGATG 
CCTTACACGAGTCTTTAACTGGCTTAGAAAATTTATTATTCTTTGGAAAAA 
- TGAAAGGTATTCAA AAAAGTGAATTAAA AC AGCAGAT AAGTC ATATTTCT 

40 AAAGTAGTAGATCTAGAAAACCAACTTGATAAATTTGTCTCAGGTTACTCA 
GAAGGTATGAAAAGACGGCTTTCTCTAGCCATCGCCCTACTTGGAAACCCC 
ACAGTTTTAATCCTAGATGAACCTACCGTTGGAATTGATCCATCCTTGAGG 
AGAAAAATCTGGCAAGAGCTAATTAATATTAAGGATGAAGGACGTTCTAT 
CTTTATTACAACCCACGTTATGGATGAAGCAGAATTAACAAGTAAGGTTGC 



ACTACTATTACGTGGAAACATTATTGCCTTTGATACTCCATTACATTTAAA 

AAAACAATTTAATGTGAGTACTATTGAGGAAGTTTTCTTAAAAGCTGAAGG 

AGAATAA 

(SEQ ID NO: 220) 

MKKVIDLKKLQKAYASETVLNNINLEVFKGEIIGLIGPSGAGKSTLIKTMLGME 

KADKGTALVLDTQMPDRNILNQIGYMAQSDALHESLTGLENLLFFGKMKGIQ 

KTELKQQITHISKVVDLENQLDKFVSGYSEGMKRRLSLAIALLGNPTVLILDEP 

TVGIDPSLRRKIWQELINIKDEGRSIFITTHVMDEAELTSKVALLLRGNIIAFDTP 

LHLKKQFNVSTEEEVFLKAEGE* 



Sequence description: 

A] Length: 717 bp - 239 aa (Possible full-length sequence) 

B] This gene sequence was not identified using the LEEP system. It was 
identified upstream of the ID- 130 gene which was identified by LEEP, during 
cloning and sequence analysis of the full-length ID- 130 gene sequence. ATG 
start codon is preceded by a possible 

Shine-Dalgarno. No obvious potential leader 
peptide sequence 

ID- 188 

Clone 3-83b (ID- 144b) 
(SEQ ID NO: 221) 

ATGGTACAAATGATACATGATATGATTAAAACAATTGAGCATTTTGCTGAG 

ACACAAGCTGATTTTCCAGTGTATGATATTTTAGGGGAAGTCCATACTTAT 

GGACAACTTAAAGTAGACTCTGACTCTCTAGCTGCTCATATTGATAGCCTA 

GGCCTTGTTGAAAAATCACCTGTCTTAGTATTCGGTGGTCAAGAATATGAA 

ATGTTGGCGACATTTGTTGCTTTAACAAAGTCAGGGCATGCTTATATACCG 

GTTGACCAACACTCTGCTTTGGATAGAATACAGGCTATTATGACAGTTGCT 

CAACCAAGCCTTATCATTTCAATTGGTGAATTTCCTCTTGAAGTTGATAAT 

GTCCCAATCCTAGACGTTTCTCAAGTTTCAGCTATTTTTGAAGAAAAGACT 

CCTTATGAGGTAACACATTCTGTTAAAGGTGATGATAATTACTATATTATT 

TTCACTTCAGGGACTACTGGTTTACCAAAAGGTGTGCAAATTTCACATGAC 

AATTTATTGAGCTTTAGAAATTGGATGATTTCTGATGATGAGTTTTGAGTTG 

CTGAAAGACCGCAAATGTTGGCTCAACCC 

(SEQ ID NO: 222) 

MVQMIHDMIKTffiHFAETQADFPVYDILGEVHTYGQLKVDSDSLAAHIDSLGL 
VEKSPVLVFGGQEYEMLATFVALTKSGHAYIPVDQHSALDRIQAIMTVAQPSL 



IISIGEFPLEVDNVPILDVSQVSAIFEEKTPYEVTHSVKGDDNYYIIFTSGTTGLP 
KGVQISHDNLLSFTNWMISDDEFSVPERPQMLAQP 



Sequence description: 

A] Length: 592 bp - 197 aa (partial sequence) 

B] This gene sequence was not identified using the LEEP system. It was 
identified downstream of the ID- 144 gene which was identified by LEEP, 
during cloning and sequence analysis of the full-length ID-144 gene sequence. 
Putative ATG start codon is preceded by a 

typical Shine-Dalgarno sequence. No obvious 

leader peptide sequence 

This orf is not in frame with nuc 



ID-189 

Clone 3-86b (ID- 145b) 
(SEQ ID NO: 223) 

ATGGAAAATCATCGTTATGAAGATGAAGGTAAATTCCAGCGTAAGATGAC 

CAGTCGTCATCTCTTTATGTTATCGCTAGGTGGTGTTATCGGGACTGGGCTT 

TTCTTGAGTTCAGGTTATACCATTGCACAGGCTGGTCCGCTTGGAGCTGTG 

CTGTCTTATTTGATTGGTGCCGTTGTGGTTTATTTGGTCATGCTATCACTTG 

GGGAATTGGCGGTTGCCATGCCGGTGACGGGGTCATTCCACACTTATGCCA 

CTAAGTTTATCAGTCCTGGAACAGGTTTTACTGTTGCTTGGCTATATTGGAT 

TTGTTGGACGGTCGCCTTGGGGACTGAATTTTTAGGTGCTGCCATGCTGAT 

GCAGCGCTGGTTCCCAAATGTGCCGGCTTGGGCATTTGCTTCCTTTTTTGCC 

CTTGTGATTTTTGGTTTAAATGCTCTTAGCGTACGCTTTTTTGCAGAAGCAG 

AGTCTTTCTTCTCAAGTATTAAGGTTATTGCTATCATTATCTTTATTATCTTG 

GGCTTAGGTGCTATGTTTGGTCTAGTTTCCTTTGAAGGTCAGGACAAGGCT 

ATTCTCTTCACTCATCTGACTGCCAATGGTGCCTTTCCAAATGGTATCGTTG 

CAGTTGTCTCAGTCATGTTGGCTGTTAACTATGCCTTCTCTGGTACTGAGTT 

AATTGGTATTGCGGCTGGTGAAACGGATAATCCCAAAGAAGCTGTACCAA 

GGGCTATTAAAACGACAATCGGTCGCTTGGTTGTTTTCTTTGTACTGACAA 

TTGTTGTCCTAGCTTCGCTATTGCCAATGAAAGAGGCAGGCGTATCCACAG 

CACCATTOGTTGATGTCTTTGACAAGATGGGAATGCCTTTTACGGCGGATA 

TCATGAACTTCGTTATCTTGACAGCCATCCTGTCTGCTGGTAACTCAGGTCT 

CTACGCATCAAGCCGTATGCTCTGGTCCCTTGCCAATGAAGGTATGTTGTC 

AAAATCTGTTGTGAAAATCAATAAACACGGTGTCCCAATGCGTGCTCTTCT 

CTTGTCAATGGCAGGAGCAGTGCTGTCGCTCTTTTCAAGTATTTACGCTGC 



AGACACAGTTTATCTAGCCTTGGTTTCAATCGCGGGCTTTGCTGTTGTTGTC 
GTATGGCTAGCCATTCCAGTCGCACAAATCAATTTCCGCAAGGAATTC 

(SEQ ID NO: 224) 

MENHRYEDEGKFQRKMTSRHLFMLSLGGVIGTGLFLSSGYTIAQAGPLGAVL 

SYLIGAVVVYLVMLSLGELAVAMPVTGSFHTYATKFISPGTGFTVAWLYWIC 

WTVALGTEFLGAAMLMQRWFPNVPAWAFASFFALVIFGLNALSVRFFAEAES 

FFSSIKVIAniFIILGLGAMFGLVSFEGQHKAILFTHLTANGAFPNGIVAVVSVM 

LAVNYAFSGTELIGIAAGETDNPKEAVPRAIKTTIGRLVVFFVLTIVVLASLLPM 

KEAGVSTAPFVDVFDKMGIPFTADIMNFVILTAILSAGNSGLYASSRMLWSLA 

NEGMLSKSVVKINKHGVPMRALLLSMAGAVLSLFSSIYAADTVYLALVSIAGF 

AVVVVWLAIPVAQINFRKEF 



Sequence description: 

A] Length: 1 126 bp - 393 aa (partial gene 
sequence) 

B] This gene sequence was not identified using the LEEP system. It was 
identified downstream of the ID- 145 gene which was identified by LEEP, 
during cloning and sequence analysis of the full-length ID- 145 gene sequence. 
Putative ATG start codon is preceded by a 

typical Shine-Dalgarno sequence. Possesses a 
possible leader peptide sequence. 

ID-190 
Clone 3-94b 
(SEQ ID NO: 225) 

TCAGAAAATGCAGAGGCAGCAACGGTTGCCACAAACTTGGTTACCAAAGG 

AGCTAATGTCATTATCGGACCAGCAACATCGGGTGCAGCTGCATCTTCAAC 

TCCAAAAGTAAATGCAGCAGCAGTTCCAATGATTGCACCTGCTGCGACAC 

AAGACAATTTAGTCTATGGTTCTGATGGAAAAACCTTAAATCAGTATTTCT 

TCCGAGCTACTTTTGTCGATAATTATCAAGGAAAGCTATTGTCTCAGTATG 

CTACAGACAACCTTAAAGCTAAAAAAGTTGTTCTATTTTATGATAATTCAT 

CAGATTACTGAAAGGGGGTAGGAAAATCATTTAAGGAAAGTTATAGTGGA 

AAAATTGTTGATAGTATGACATTCTCCGCTGGTGATACTGATTTCCAAGCG 

TCATTGACTAAGTTGAAAGGGAAAGAATATGATGCTATTGTGATGCCAGG 

TTACTATACCGAGACAGGATTAATAGTTAAGCAAGCGCGTGATTTAGGTAT 

CTCTAAACCGGTTCTTGGGCCTGATGGTTTTGATAGTCCGAAATTTGTGCA 



ATCGGCAACACCTGTAGGAGCTTCAAACGTTTATTATTTGACAGGTTTCAC 
TACACAAGGATCAACCAAAGCTAAAGCT 

(SEQ ID NO: 226) 

5 SENAEAATVATNLVTKGANVnGPATSGAAASSTPKVNAAAVPMIAPAATQD 
NLVYGSDGKTLNQYFFRATFVDNYQGKLLSQYATDNLKAKKVVLFYDNSSD 
YSKGVAKSFKESYSGKIVDSMTFSAGDTDFQASLTKLKGKEYDAIVMPGYYT 
ETGLIVKQARDLGISKPVLGPDGFDSPKFVQSATPVGASNVYYLTGFTTQGST 
KAKA 

10 

Sequence description 



15 A] Length: 637 bp - 231 aa (partial sequence) 

B] This gene sequence was not identified using the LEEP system. It was 
identified upstream of the ID-149 gene which was identified by LEEP, during 
cloning and sequence analysis of the full-length ID-149 gene sequence. N- and 
C-termini have yet to be determined 

20 

ID-191 

Clone 2-c94b (ID- 153b) 

25 

(SEQ ID NO: 227) 

TTGGGACTTAAAGACCATGCTTTAGTCTATCCATTTTCATTATCTGGGGGG 
CAAAAGCAACGTGTCGCACTAGCTCGTGCGATGATGATTGATCCACAGATT 
ATTGGTTATGATGAGCCAACTAGCGCTCTTGATCCAGAGTTGCGTCAAGAA 
30 GTAGAAAAACTAATTTTACAAAATAGAGAAACAGGTATGACACAAATTGT 
AGTAACACATGATCTTCAATTTGCTGAAAGTATATCTGATACGATTCTCAA 
AATTAATCCTAAGTAG 

(SEQ ID NO: 228) 

35 MGLKDHALVYPFSLSGGQKQRVALARAMMIDPQIIGYDEPTSALDPELRQEV 
EKLILQNRETGMTQIVVTHDLQFAESISDTILKINPK* 



Sequence description - -- 

40 

A] Length: 270 bp - 90 aa (partial sequence) 



B] This gene sequence was not identified using the LEEP system. It was 
identified upstream of the ID- 153 gene which was identified by LEEP, during 
cloning and sequence analysis of the ID- 153 gene sequence. 
N-terminus has yet to be determined 

5 

ID- 192 

Clone 2-clb(ID-155b) 

10 

(SEQ ID NO: 229) 

ATGACTAATATCTCAGATGTTCCAAAAGCTATTAGAACACAGGCACAGTAT 
GTTCTCTTGGGAATGAGAGTTATGGATCAGTCGGTATTACCGAAAACATAT 
AATTCAAAAGAACCTTATTTGAAACCAGATATGATTTATATTCATGATAGA 
1 5 AG AC A AG AG AC A ATGCTTA AAATC ACTC A AG A AATAG AAATGG AGG ATTG 
A 

(SEQ ID NO: 230) 

MTNISDVPKAIRTQAQYVLLGMRVMDQSVLPKTYNSKEPYLKPDMIYIHDRR 
20 QETMLKITQEIEMEH* 



Sequence description 

25 

A] Length: 204 bp - 68 aa (partial sequence) 

B] This gene sequence was not identified using the LEEP system. It was 
identified upstream of the ID- 155 gene which was identified by LEEP, during 
cloning and sequence analysis of the ID- 155 gene sequence. 

30 ATG start codon is preceded by a potential typical Shine-Dalgarno sequence. 

Has a 

typical leader peptide. N-terminus has yet to be 
verified 

35 

ID-193 

Clone 2-54altb (ID- 172b) 



40 (SEQ ID NO: 231) 

AAGCTTGCATGCCTGCAGGTCGACTCTAGAGGATCTTGGGGAATATAAATT 
TGGATTTCATGACGATGTAAAGCCAATTTATTCTACGGGAAAAGGTCTAAA 
TGAGGCTGTTATTCGTGAGTTATCTGCAGCTAAGGGTGAACCTGAGTGGAT 
GTTGGACTTTCGTCTAAAATCCTTGGAAACGTTTAATAAAATGCCGATGCA 



GACCTGGGGAGCAGATTTATCAGATATTGATTTTGATGATATTATTTATTA 
TCAAAAAGCATCTGATAAACCTGCGCGTGATTGGGATGATGTTCCAGAAA 
AAATCAAAGAAACTTTTGAAAGAATTGGGATTCCAGAAGCTGAAAGAGCC 
TATCTTGCAGGAGCATCAGCACAATATGAATCAGAAGTAGTTTATCACAAT 
5 ATGAAAGAAGAATATGATAAGCTGGGTATTGTTTTTACGGATACTGACTCT 
GCACTTAAAGAGTACCCAGAGCTATTCAAAAAATATTTTGCTAAACTTGTC 
CCTCCAACAGATAATAAATTAGCTGCTCTGAACTCTGCTGTATGGTCAGGT 
GGAACATTTATTTATGTTCCTAAAGGTGTTAAGGTGGATATTCCACTTCAA 
ACTTACTTCCGTATTAATAATGAAAATACTGGACAATTTGAACGTACTCTC 

1 0 ATTATTGTTGATGAGGGAGC AAGTGTTCACTATGTTGA AGGTTGTACCGCC 
CCAACTTATTCTTCAAATAGTTTACATGCAGCTATAGTTGAAATTTTTGCAC 
TTGATGGAGCTTATATGCGCTATACGACTATTCAAAATTGGTCCGATAATG 
TCTATAATTTAGTGACAAAACGTGCTACCGCTAAAAAAGATGCAACAGTT 
GAGTGGATAGATGGAAATCTAGGAGCTAAAACAACAATGAAATACCCATC 

1 5 GGTTTACCTTGATGGTGAAGGAGCACGTGGC ACGATGTTGTCTATTGCTTT 
TGCAAACAAAGGACAACACCAAGATACGGGTGCAAAGATGATTCATAATG 
CCCCCCATACTAGTTCATCCATTGTCTCTAAATCAATTGCTAAGGGTGGGG 
GAAAAGTTGATTATCGAGGTCAAGTGACATTTAATAAAGATTCCAAAAAA 
TCAGTGTCACATATAGAATGTGACACCATATTGATGGATGATATTTCAAAA 

20 TCAGATACCATACCGTTTAATGAGATTCATAATTCACAGGTTGCTTTAGAG 
CATGAAGCAAAGGTGTCTAAGATTTCTGAAGAGCAACTGTACTACTTGATG 
AGTCGAGGTTTATCTGAAGCTGAAGCAACAGAAATGATTGTTATGGGGTTT 
GTTGAGCCCTTTACGAAAGAATTACCAATGGAATATGCGGTAGAGTTAAA 
TCGTTTAATTTCCTATGAAATGGAAGGTTCAGTTGGTTAA 

25 

(SEQ ID NO: 232) 

MHACRSTLEDLGEYKFGFHDDVKPIYSTGKGLNEAVIRELSAAKGEPEWMLD 

FRLKSLETFNKMPMQTWGADLSDIDFDDIIYYQKASDKPARDWDDVPEKIKE 

TFERIGIPEAERAYLAGASAQYESEVVYHNMKEEYDKLGIVFTDTDSALKEYP 

30 ELFKKYFAKLVPPTDNKLAALNSAVWSGGTFIYVPKGVKVDIPLQTYFRINNE 
NTGQFERTLIIVDEGASVHYVEGCTAPTYSSNSLHAAIVEIFALDGAYMRYTTI 
QNWSDNVYNLVTKRATAKKDATVEWIDGNLGAKTTMKYPSVYLDGEGARG 
TMLSIAFANKGQHQDTGAKMIHNAPHTSSSIVSKSIAKGGGKVDYRGQVTFN 
KDSKKSVSHIECDTILMDDISKSDTIPFNEIHNSQVALEHEAKVSKISEEQLYYL 

35 MSRGLSEAEATEMIVMGFVEPFTKELPMEYAVELNRLISYEMEGSVG* 



Sequence description: 

40 A] Length: 1411 bp - 469 aa (Possible full-length gene) 

B] This gene sequence was not identified using the LEEP system. It was 
identified downstream of the ID-72 gene which was identified by LEEP, 
during cloning and sequence analysis of the full-length ED-72 gene sequence. 
No obvious Shine Dalgarno sequence upstream of 



TTG start codon insufficient sequence data). N 
terminus needs verification. 



ID- 194 

Clone 3- lb (ID-81b) 
(SEQ ID NO: 233) 

ATGATAGAATTCTTTTCTAATATCAGAACAGAGATTCCGCAGATGCCTTTA 

CTTATCCATAGTTTGATTTTATCTGTCTTACCTTTTCTGATGTGGCTGACTTT 

GGTTAATAGAGATAAGCCTTTGTATAAAACTATTTGGAGTATCCTTTTAGG 

ACTTCAGTTAATTACGATTTATACTTGGTTTTTCTGGGCAAAATTGCCTTTA 

TCTGAAAGTCTTCCCCTTTACCATTGTCGAATAGGCATGTTTGTGGGTCTCT 

TA 

(SEQ ID NO: 234) 

MIEFFSNIRTEIPQMPLLIHSLILSVLPFLMWLTLVNRDKPLYKTiWSILLGLQLI 
TIYTWFFWAKLPLSESLPLYHCRIGMFVGLL 



Sequence description 

A) Length: 261 bp - 87 aa (partial gene sequence) 

B) This gene sequence was not identified using the LEEP system. It was identified 
downstream of the ID-81 gene which was identified by LEEP, during cloning and 
sequence analysis of the full-length ID-81 gene sequence. Sequence Characteristics: 
Possesses a potential leader peptide sequenceOrf is preceded by a potential Shine- 
Dalgarno sequence. 

ID-195 

Clone RS-55b 
(SEQ ID NO: 235) 

AAGCTTGTGCAAAGTATTAAAGAGATAGGATTAGCTAATGCGCATTTATTA 

GCTGTTGCTCCGACAGGGTCAATCAGTTATCTTTCTTCTTGTACTCCGAGCC 

TTCAACCGGTTGTATCACCTGTCGAAGTACGCAAGGAAGGAGCACTGGGG 

AGGGTTTATGTAGCTGCTTATAAGATTGATGCAGATAATTATGTGTAGTAC 

AAAAAAGGAGCTTATGAAGTGGGATCTGAGGCGATTATCAATATTGCAGC 

TGCCGCTCAAAAACACATTGATCAAGCTATTTCGTTAACGCTTTTCATGAC 

AGATCAAGCAACTACGCGAGATTTAAATAAAGCCTATATTCAAGCATTTA 

AACAAAAATGTGCCTCTATTTATTATGTACGAGTGAGACAGGACATCCTAG 



AAGGTAGCGAGAGTTATGATGATATGCTGGATGATTTCACTTCATCGGACT 
TAGAAGACTGTCAATCCTGCATGATTTAA 

(SEQ ID NO: 236) 

KLVQSIKEIGLANAHLLAVAPTGSISYLSSCTPSLQPVVSPVEVRKEGALGRVY 
VAAYKIDADNYVYYKKGAYEVGSEAIINIAAAAQKHIDQAISLTLFMTDQATT 
RDLNKAYIQAFKQKCASIYYVRVRQDILEGSESYDDMLDDFTSSDLEDCQSC 
MI* 



Sequence description: 

A] Length 486 bp - 162 aa (Partial sequence) 

B] This gene sequence was not identified using the LEEP system. It was identified 
upstream of the ID-87 gene which was identified by LEEP, during cloning and 
sequence analysis of the full-length ID-87 gene sequence. N-terminus to be 
determined. 



ID-196 

Clone RS-59(ID-90b) 
(SEQ ID NO: 237) 

GTGAGGACATATATTACAAACTTGAATGGACATTCAATCACTAGTACAGC 

ACAAATAGCTCAAAACATGGTAACAGATATAGCAGTAAGCTTAGGTTTTC 

GTGAGCTGGGAATACATTCTTATCCGATTGATACTGATTCTCCTGAGGAAA 

TGAGTAAGCGTTTAGATGGAATCTGTTCCGGACTTAGAAAAAATGATATTG 

TCATATTTCAGACACCTACATGGAACACTACAACTrTTGATGAAAAATTAT 

TTCACAAATTAAAAATATTTGGTGTAAAGATTGTTATTTTTATACATGATGT 

TGTACCGCTAATGTTTGATGGAAATTTTTATTTGATGGATAGAACTATAGC 

TTATTATAATGAAGCAGATGTTTAATAGCCCCTAGTCAAGCAATGGTCGAT 

AAGCTT 

(SEQ ID NO: 238) 

MRTYITNLNGHSrTSTAQIAQNMVTDIAVSLGFRELGIHSYPIDTDSPEEMSKRL 
DGICSGLRKNDIVIFQTPTWNTTTFDEKLFHKLKIFGVKrVIFIHDVVPLMFDGN 
FYLMDRTIAYYNEADVLIAPSQAMVDKL 



Sequence description: 

A] Length: 414 bp - 138 aa(partial gene) 



B] This gene sequence was not identified using the LEEP system. It was 
identified downstream of the ID-90 gene which was identified by LEEP, 
during cloning and sequence analysis of the full-length ID-90 gene sequence. 
No obvious signal peptide, but a 
5 possible Shine Dalgarno sequence is present 

upstream of ATG start codon. C-terminus has yet 
to be determined. 



10 ID-197 

Clone RS-59c (ID-90c) 
(SEQ ID NO: 239) 

1 5 C ATGGA AATGA AGTTG ATG ATGTTATTAG A AGGGCATTTG AAT AT AATC AC 
CTTATCTTTGCTTTTGATAATACCTGTCATAACAGAGAGTTAGTATTAGATA 
GCAATATCATTTCTCACACAACCTGTGAACAATTGATAAATTTAATGAAAA 
ATTTATCAGGCTCCATTATGTATTTGCTAGAGCAACAAAGAGAACAAACA 
AGTAATGAAACAAAAGAGCGTTATAAAGAAATATTAGGAGGGTATGGAA 

20 ATGCCTAA 

(SEQ ID NO: 240) 

HGNEVDDVIRRAFEYNHLIFAFDNTCHNRELVLDSNIISHTTCEQLINLMKNLS 
GSIMYLLEQQREQTSNETKERYKEILGGYGNA* 

25 

Sequence description: 

A] Length: 261 bp - 87 aa(partial gene sequence) 
30 B] This gene sequence was not identified using the LEEP system. It was 

identified upstream of the ID-90 gene which was identified by LEEP, during 
cloning and sequence analysis of the full-length ID-90 gene sequence. N- 
terminus has yet to be determined 

35 

ID-198 

Clone RS-70b (ID-93b) 

40 (SEQ ID NO: 241) 

ACATTTTTATATTATGTATTTGAAGACGTAGCCACCCAGTCAAATATGACT 
GGGAAGATTTTTAGTATGTCTAAAGAAGAGTTGTCATATTTACCCGTTATT 
AAACTTTTTAAGAATCAAGGTGTATACAACGGCTTGATTGGTCTATTCCTC 
CTTTATGGGTTATATATTTCACAGAATCAAGAAATTGTAGCTATTTTTTTAA 



TCAATGTGTTGCTAGTTGCTGTTTATGGTGCTTTGACAGTTGATAAAAAAA 
TCTTATTAAAACAGGGTGGTTTACCTATATTAGCTCTTTTAACATTCTTATT 
TTAA 

5 (SEQ ID NO: 242) 

TFLYYVFEDVATQSNMTGKIFSMSKEELSYLPVIKLFKNQGVYNGLIGLFLLY 
GLYISQNQEIVAIFLINVLLVAVYGALTVDKKILLKQGGLPILALLTFLF* 



10 Sequence description: 

A] Length: 312 bp - 104 aa (partial gene sequence) 

B] This gene sequence was not identified using the LEEP system. It 
was identified upstream of the ID-93 gene which was identified by 

15 LEEP, during cloning and sequence analysis of the full-length ED-93 

gene sequence. 

N-terminus has yet to be determined 

20 ID- 199 

Clone RS-70c (ID-93c) 
(SEQ ID NO: 243) 

25 ATGAAATTAAGTGTCCTTGATTATGGGCTTATTGATTATGGAAAAACTGCA 
AGTGATGCAATACAAGAAACGATTCTTTTATCACAAGAGGCGGAGCAACT 
AGGCTATCATCAATTTTGGGTGGCTGAACATCACGGTGTTAAGGCATTCAG 
TATTAGCAATCCAGAATTAATGATAATGCATTTGGCTAACCAGACTAAATC 
TATCAAAATTGGCTCTGGAGGTATAATGCCTCTGCACTATAGTAGTTTTAA 

30 ACTCGCGGAGACTCTCAAGACATTAGAGACATGTCATCCGAATCGAGTAA 
GTATTGGTTTAGGAAATTCACTAGGGACAGTTAAAGTTTCAAATGCACTTC 
GTAGCTTACATAAAGCACATGATTACGAAGAGGTACTGGAGGAATTGAAG 
TCATGGCTTATTGATGAATCATCCAGTAAGGAACCATTAGTTCAACCGACT 
CTTTCTAGCTTCCCAGACTTATATGTGTTGGGGAGTGGTCAAAAATCAGCT 

35 TATTTAGCGGCTAAACTTGGCTTAGGCTTTACCTTCGGTGTTTTTCCTTTTA 
TGGACAAAGACCCATTGACAGAAGCTAAA 

(SEQ ID NO: 244) 

M KLS VLD YGLID YGKT ASD AIQETILLS QE AEQLG YHQFW V AEHHG VKAFSIS 
40 NPELMIMHLANQTKSIKIGSGGIMPLHYSSFKLAETLKTLETCHPNRVSIGLGN 
SLGTVKVSNALRSLHKAHDYEEVLEELKSWLIDESSSKEPLVQPTLSSFPDLYV 
LGSGQKSAYLAAKLGLGFTFGVFPFMDKDPLTEAK 



Sequence description: 

A] Length: 588 bp - 196 aa (partial) 

B] This gene sequence was not identified using the LEEP system. It 
5 was identified downstream of the ID-93 gene which was identified by 

LEEP, during cloning and sequence analysis of the full-length ID-93 
gene sequence. No obvious signal peptide, but Shine Dalgarno 
sequence upstream of the ATG start codon. 
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Figure 3 



nueSl 

BgX II Sco KV 

S * >cg a£acctgatatc tcacaaacagataAeggegtaaatag -3 1 (SEQ ID NO* : 245) 
nucS2 

Bgl II 3ma I 

3 * ~ga agatict: t ^cccwc a^acaaaaaoafcaac<Tqoofcaaafcacr -3 • (SBQ ID NO. : 246) \ 
nucS3 

BqX II Zco RV 

S < "cg aoatctqatatce atcacaaacaoatAaeoQcctaa&tiig -3* (SEQ ID 'NO * 247) 
nucR 

Baa K I 

5 ' - cg gQa^-cc ttatqqaectQaatoaaccrttqtc -3 » (SBQ ID NO.: 248) 

HucSeq 

S'-ggatgctttgtttx^aggtgtafcc -3 1 (SEQ ID NO. : 249) 

pTREPy. 

3 • ~cat^ta*cg^^ccitcaa^t^^t^tca*t^fccc^gca^tg^ 

caatttcacac -3' (SEQ ID NO, : 250) 

5 ■ -gcggat;cx:cc:c^ggct;C»at^atg**fcaaacaet»gt^ 

gt=tatccgcta ~3- (SBQ ID ^ . 2 5l) 

S ' -cgccagggttttcccagtcacgac -3 • (SEQ ID IK). : 252) 

5 ' -tcaggggggcgyagcctatg -3 • (SBQ ID NO. : 253) 

-3. (SBQ ID NO. : 254) 

V 2 

s . _ iccg9ctcgtatgctgcgwgMCCg .3 . (SBQ ID NO. : 255) 



Figure 4 



pTREP-Nuc vectors allow cloning of genomic DNA into each frame 
with respect to the nuclease gene 



P) 



pTREPl-nucl (EcoRV) XAgTATCAGATCT — gATATC - — TCfr C ft C ^SATAACCCCGrYKAAT Fraua«»>l 

:i :!S!t:!l!M A 
(SEQ ID WD. ;256) ? : 5 : : : : 1 ; : 1 : : 

pTR£?l-nud2 <Sm 1) AACTATCAgATCTTC CXCCCCA "* TCACAAACACXTAAGQSCCIAAAJ Fraa*-^2 

(SEQ ID N0« : 257) . . . . s s . . . . . . . * . 5 . . , . t ; : : . , . , 5 , . ; . 1 . . 3 . 

(SEQ ID MO * 25o) 

* 4 » ~ • — — — » — ~ — ■ — ^ AAA Ik tt 

(SEQ ID MX : 259) 

Cloning sito id Indicated bfe an arrow 
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SDS-PAGE analysis of the purified ID-65 and ID-83 protein antigens 
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Figure 7 



SDS-PAGE analysis of the purified ID-89 and ID-96 protein antigens 
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Figure 8 

IgG Titres against the ID-65 and ID-83 Proteins 



ID-65 and ID-83 Vaccinations -IgG Titres 
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Figure 9 
Survival Data 



ID-93 Vaccination- GBS Challenge and Survival 
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Figure 10 



IgG Titres against the ID-93 Protein 
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Figure 13 Southern blot analysis - ID-65 
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IgG Titres against the ID-89 and ID-96 Proteins 



ID-89 and ID-96 Protein Vaccines -IgG Titres 
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Figure 14 Southern blot analysis - ID-89 
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Figure 15 



Southern blot analysis - ID-93 
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Figure 16 Southern blot analysis - ID-96 
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